FIGURE l 



GGGGCTTCGGCGCCAGCGGCCAGCGCTAGTCGGTCTGGTAAGGATTTACAAAAGGTGCAGGTATG 
AGCAGG TC TGAAGAC TAACAT T T T GT GAAGT T GTAAAACAGAAAAC CT GTTAGAAATGTGGTGGT 
TTCAGCAAGGCCTCAGTTTCCTTCCTTCAGCCCTTGTAATTTGGACATCTGCTGCTTTCATATTT 
TCATACAT TACT GCAGTAACACT CCACCATATAGACCCGGC TTT ACCT TATATCAGT GACAC TGG 
T ACAGTAGCTCCAGAAAAAT GCT T AT T TGGGGCAATGC TAAATAT T GCGGCAGTT TTATGCAT TG 
C T ACCAT T TATGT T CGTTATAAGCAAGT TCATGC T CT GAGT CCT GAAGAGAACGT TAT CATCAAA 
TTAAACAAGGCTGGCCTTGTACTTGGAATACTGAGTTGTTTAGGACTTTCTATTGTGGCAAACTT 
CCAGAAAACAACCCTTTTTGCTGCACATGTAAGTGGAGCTGTGCTTACCTTTGGTATGGGCTCAT 
?i TATATATGT T TGT T CAGACCATCCT T T CCTAC CAAAT GCAGCCCAAAATCCATGGCAAACAAGT C 

||| TTCTGGATCAGACTGTTGTTGGTTATCTGGTGTGGAGTAAGTGCACTTAGCATGCTGACTTGCTC 
III AT CAGT T TT GCACAGTGGCAATT T TGGGAC T GAT T T AGAACAGAAACTC CAT T GGAACCCCGAGG 

ACAAAGG TTATGT GC T TCACATGAT CAC TACT GCAGCAGAAT GGT C TAT GTCATT T T CC T TC T T T 
GGTTTTTTCCTGACTTACATTCGTGATTTTCAGAAAATTTCTTTACGGGTGGAAGCCAATTTACA 
T GGAT TAACCCTCTATGACACTGCACC T TGCCCTAT TAACAATGAACGAACACGGCT AC T T T CCA 
Q GAGAT AT T TGATGAAAGGATAAAATAT T TC TGTAATGAT TAT GATTCT CAGGGAT TGGGGAAAGG 

HI T TCACAGAAGT T GCT TATT C TTCT CT GAAAT T TT CAACCAC T TAAT CAAGGC TGACAGTAACACT 

JjJ GATGAATGCT GATAATCAGGAAACAT GAAAGAAGCCAT T TGATAGATTAT TC TAAAGGATAT CAT 

d«» CAAGAAGACTAT TAAAAACACCTATGCCTATACTT TT TTATCTCAGAAAATAAAGTCAAAAGACT 

fli ATG 



[ft 
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FIGURE 2 

<subunit 1 of 1, 266 aa, 1 stop 
<MW: 29766, pi: 8.39, NX(S/T): 0 

MWWFQQGLSFLPSALVIWTSAAFIFSYITAVTLHHIDPALPYISDTGTVAPEKCLFGAMLNIAAV 
LCIATIYVRYKQVHALSPEEOTIIKLNKAGLVLGILSCLGLSIVANFQKTTLFAAHVSGAVLTFG 
MGSLYMFVQTILSYQMQPKIHGKQVFWIRLLLVIWCGVSALSMLTCSSVLHSGNFGTDLEQKLHW 
NPEDKGYVLHMITTAAEWSMSFSFFGFFLTYIRDFQKISLRVEANLHGLTLYDTAPCPINNERTR 
LLSRDI 



jp Important features: 

01 Type II transmembrane domain: 

Y$ amino acids 13-33 

m Other Transmembrane domains: 

B amino acids 54-73, 94-113, 160-180, 122-141 

N-myristoylation sites. 



Hi 



amino acids 57-63, 95-101, 99-105, 124-130, 183-189 
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FIGURE a 

CGGACGCGTGGGCGGACGCGTGGGGGAGAGCCGCAGTCCCGGCTGCAGCACCTGGGAGAAGGCAGACC 
GTGTGAGGGGGCCTGTGGCCCCAGCGTGCTGTGGCCTCGGGGAGTGGGAAGTGGAGGCAGGAGCCTTC 
CTTACACTTCGCCATGAGTTTCCTCATCGACTCCAGCATCATGATTACCTCCCAGATACTATTTTTTG 
GATTTGGGTGGCTTTTCTTCATGCGCCAATTGTTTAAAGACTATGAGATACGTCAGTATGTTGTACAG 
GTGATCTTCTCCGTGACGTTTGCATTTTCTTGCACCATGTTTGAGCTCATCATCTTTGAAATCTTAGG 
AGTATTGAATAGCAGCTCCCGTTATTTTCACTGGAAAATGAACCTGTGTGTAATTCTGCTGATCCTGG 
T TTT CAT GGTGCCTT T TTACATTGGCTATTT TATT GT GAGCAATATCCGACT ACTGCATAAACAACGA 
CTGCTTTTTTCCTGTCTCTTATGGCTGACCTTTATGTATTTCTTCTGGAAACTAGGAGATCCCTTTCC 
CATTCTCAGCCCAAAACATGGGATCTTATCCATAGAACAGCTCATCAGCCGGGTTGGTGTGATTGGAG 
TGACTCTCATGGCTCTTCTTTCTGGATTTGGTGCTGTCAACTGCCCATACACTTACATGTCTTACTTC 
CTCAGGAAT GT GACTGACACGGATATTCTAGCCCT GGAACGGCGACTGCTGCAAACCAT GGATATGAT 
CATAAGCAAAAAGAAAAGGATGGCAAT GGCACGGAGAACAATGTT CCAGAAGGGGGAAGT GCATAACA 
AACCATCAGGTTTCTGGGGAATGATAAAAAGTGTTACCACTTCAGCATCAGGAAGTGAAAATCTTACT 
CTTATTCAACAGGAAGTGGATGCTTTGGAAGAATTAAGCAGGCAGCTTTTTCTGGAAACAGCTGATCT 
ATAT GCTACCAAGGAGAGAATAGAATACTCCAAAACCTTCAAGGGGAAATAT TTTAATTTTCT TGGT T 
ACTTTTTCTCTATTTACTGTGTTTGGAAAATTTTCATGGCTACCATCAATATTGTTTTTGATCGAGTT 
GGGAAAACGGAT CCTGTCACAAGAGGCATTGAGAT CACT GTGAATTATCTGGGAAT CCAAT TT GATGT 
GAAGT TTTGGT CCCAACACATTTCCTT CATT C TTGTT GGAATAAT CATCGTCACAT CCATCAGAGGAT 
TGCTGATCACTCTTACCAAGTTCTTTTATGCCATCTCTAGCAGTAAGTCCTCCAATGTCATTGTCCTG 
CTATTAGCACAGATAATGGGCATGTACTTTGTCTCCTCTGTGCTGCTGATCCGAATGAGTATGCCTTT 
AGAAT ACCGCACCATAATCACTGAAGTCCTT GGAGAACTGCAGT T CAACTTCTAT CACCGTTGGTTT G 
ATGTGATCTTCCTGGTCAGCGCTCTCTCTAGCATACTCTTCCTCTATTTGGCTCACAAACAGGCACCA 
GAGAAGCAAATGGCACCTTGAACTTAAGCCTACTACAGACTGTTAGAGGCCAGTGGTTTCAAAATTTA 
GAT ATAAGAGGGGGGAAAAAT GGAACCAGGGCCTGACATTT TATAAACAAACAAAATGCTATGGTAGC 
ATT TT TCACCTT CATAGCATACTCCTT CCCCGTCAGGTGATACTATGACCAT GAGTAGCAT CAGCCAG 
AACATGAGAGGGAGAACTAACTCAAGACAATACTCAGCAGAGAGCATCCCGT GTGGATAT GAGGCTGG 
TGTAGAGGCGGAGAGGAGCCAAGAAACTAAAGGTGAAAAATACACTGGAACTCTGGGGCAAGACATGT 
CTATGGTAGCTGAGCCAAACACGTAGGATTTCCGTTTTAAGGTTCACATGGAAAAGGTTATAGCTTTG 
CCTTGAGATTGACTCATTAAAATCAGAGACTGTAACAAAAAAAAAAAAAAAAAAAAAGGGCGGCCGCG 
ACTCTAGAGTCGACCTGCAGAAGCTTGGCCGCCATGGCCCAACTTGTTTATTGCAGCTTATAATG 



10063716 



Page 157 



FIGURE 4 

msflidssimitsqilffgfgwlffmrqlfkdyeirqywqvifsvtfafsctmfeliifeilgv 
lnsssryfhwkjytnlcvillilvfmvpfyigyfivsnirllhkqrllfscllwltfmyffwklgdp 
fp i lspkhgi ls ieql i srvgvi gvtlmallsgfgavnc pyt yms y flrnvtdt di lalerrllq 
tmdmiiskkkrmamj^r™fqkgew^ 

lfletadlyatkerieysktfkgkyfnflgyffsiycvwkifmatinivfdrvgktdpvtrgiei 
tvnylgiqfdvkfwsqhisfilvgiiivtsirgllitltkffyaissskssnvivlllaqimgmy 
fvs svllirmsmple yrt 1 1 tevlgelqfnfyhrwfdvi flvsalss i lflylahkqapekqmap 

Important features: 
Signal peptide: 

amino acids 1-23 

Potential transmembrane domains: 

amino acids 37-55, 81-102, 150-168, 288-311, 338-356, 375-398, 
425-444 

N-glycosylation sites. 

amino acids 67-70, 180-183 and 243-246 

Eukaryotic cobal ami n -binding proteins 

amino acids 151-160 
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FIGURE s 



AGCAGGGAAATCCGGATGTCTCGGTTATGAAGTGGAGCAGTGAGTGTGAGCCTCAACATAGTTCC 
AGAACT CTCCAT CCGGACTAGT TAT TGAGCATCT GCC T C T CAT AT CACCAGTGGCCAT C TGAGGT 
GTTTCCCTGGCTCTGAAGGGGTAGGCACG ATG GCCAGGTGCTTCAGCCTGGTGTTGCTTCTCACT 
TCCATCTGGACCACGAGGCTCCTGGTCCAAGGCTCTTTGCGTGCAGAAGAGCTTTCCATCCAGGT 
GTCATGCAGAAT TATGGGGATCACCCTTGTGAGCAAAAAGGCGAACCAGCAGCTGAATT TCACAG 
AAGC TAAGGAGGCCT GTAGGCT GC T GGGAC TAAGT TTGGCCGGCAAGGACCAAGT T GAAACAGCC 
TTGAAAGCTAGCTTTGAAACTTGCAGCTATGGCTGGGTTGGAGATGGATTCGTGGTCATCTCTAG 
GATTAGCCCAAACCCCAAGTGTGGGAAAAATGGGGTGGGTGTCCTGATTTGGAAGGTTCCAGTGA 
GCCGACAGTTTGCAGCCTATTGTTACAACTCATCTGATACTTGGACTAACTCGTGCATTCCAGAA 
AT TATCACCACCAAAGATCCCATATT CAACACT CAAACTGCAACACAAACAACAGAAT T TAT T GT 
H CAGTGACAGTACCTACTCGGTGGCATCCGCTTACTCTACAATACCTGCCCCTACTACTACTCCTC 
Q C T GCTCCAGCT T CCACT TC TAT TCCACGGAGAAAAAAAT TGATT TGTGTCACAGAAGT T T TTATG 

GAAACTAGCACCATGTCTACAGAAACTGAACCATT TGT TGAA?^ATAAAGCAGCATT CAAGAATGA 
SJ AGCTGCTGGGTTTGGAGGTGTCCCCACGGCTCTGCTAGTGCTTGCTCTCCTCTTCTTTGGTGCTG 
f] CAGCTGGTC T TGGAT T T TGC TAT GTCAAAAGGTAT GTGAAGGCCT T CCC T T T TACAAACAAGAAT 

III CAGCAGAAGGAAATGATCGAAACCAAAGTAGTAAAGGAGGAGAA.GGCCAATGATAGCAACCCTAA 
SJ T GAGGAATCAAAGAAAACT GATAAAAACCCAGAAGAGTCCAAGAGT CCAAGCAAAACT ACCGTGC 

| :: f ;: GAT GCC TGGAAGCTGAAGTT TAGATGAGACAGAAATGAGGAGACACACC TGAGGCT GGT T TCT T T 

CAT GCT CC T TACCCT GCCCCAGC T GGGGAAATCAAAAGGGCCAAAGAACCAAAGAAGAAAGTCCA 
u " CCCTTGGTTCCTAACTGGAATCAGCTCAGGACTGCCATTGGACTATGGAGTGCACCAAAGAGAAT 
* GCCCTTCTCCTTATTGTAACCCTGTCTGGATCCTATCCTCCTACCTCCAAAGCTTCCCACGGCCT 
Q T TCTAGCC TGGC TATGTCC TAATAATATCCCACT GGGAGAAAGGAGT T T TGCAAAGT GCAAGGAC 

pi CTAAAACATCTCATCAGTATCCAGTGGTAAAAAGGCCTCCTGGCTGTCTGAGGCTAGGTGGGTTG 
iSi AAAGCCAAGGAGTCAC TGAGACCAAGGC TT TC TC TACTGAT TCCGCAGC TCAGACCC TT TCT T CA 

b* . GCTCTGAAAGAGAAACACGTATCCCACCTGACATGTCCTTCTGAGCCCGGTAAGAGCAAAAGAAT 
W GGCAGAAAAGTTTAGCCCCTGAAAGCCATGGAGATTCTCATAACTTGAGACCTAATCTCTGTAAA 
CJ! GCTAAAATAAAGAAATAGAA£AAGGCTGAGGATACGACAGTACACT 

HI ACAGACAGGGTCAAAGTGT TT TCTCTGAACACAT TGAGTTGGAATCACTGTTTAGAACACACACA 

CTTACTTTTTCTGGTCTCTACCACTGCTGATATTTTCTCTAGGAAATATACTTTTACAAGTAACA 
AAAATAAAAACT CTTATAAATTTCTATT TT TATC TGAGT TACAGAAAT GAT TACT AAGGAAGATT 
AC TCAGTAAT TT GT T TAAAAAGTAATAAAAT T CAACAAACAT T TGCTGAATAGCT ACTATATGT C 
AAGT GC TGTGCAAGGTATTACACTCT GTAAT T GAATAT TAT TCCT CAAAAAAT TGCACATAGTAG 
AACGC TATC TGGGAAGC TATT TT T T T CAGT T T TGATAT T TC TAGCT T AT C TACTT CCAAAC TAAT 
T T T TAT T TT TGCT GAGAC TAATCT T AT TCAT T TT C T CTAATATGGCAACCAT TATAACC T TAAT T 
TAT TAT TAACATACCTAAGAAGTACAT TGT TACCT CTATATACCAAAGCACAT TT TAAAAGT GCC 
AT TAACAAATGTATCAC TAGCCC TCCT T TT TCCAACAAGAAGGGACT GAGAGATGCAGAAATAT T 
T GT GACAAAAAAT TAAAGCAT TTAGAAAACT T 
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FIGURE 6 



MARCFS LVLLLT S I WT TRLLVQGSLRAEELS I QVSCRIMG I TLVSKKANQQLNFTEAKEACRLLG 
LSLAGKDQVETALKASFETCSYGWGDGFWISRISPNPKCGKNGVGVLIWKVPVSRQFAAYCYN 
SSDTWTNSCIPEIITTKDPIFNTQTATQTTEFIVSDSTYSVASPYSTIPAPTTTPPAPASTSIPR 
RKKLICVTEVFMETSTMSTETEPFVENKAAFKNEAAGFGGVPTALLVLALLFFGAAAGLGFCYVK 
RYVKAFPFTNKNQQKEMIETKVVKEEKANDSNPNEESKKTDKNPEESKSPSKTTVRCLEAEV 

Signal sequence : 

amino acids 1-16 

Transmembrane domain: 

amino acids 235-254 

N-glycosylation site. 

amino acids -53-57, 130-134, 289-293 

Casein kinase II phosphorylation site. 

amino acids 145-149, 214-218 

Tyrosine kinase phosphorylation site. 

amino acids 79-88 

N-myristoylation site. 

amino acids 23-29, 65-71, 234-240, 235-239, 249-255, 253-259 
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FIGURE 7 



CGCCGCGCTCCCGCACCCGCGGCCCGCCCACCGCGCCGCTCCCGCATCTGCACCCGCAGCCCGGC 
GGCCTCCCGGCGGGAGCGAGCAGATCCAGTCCGGCCCGCAGCGCAACTCGGTCCAGTCGGGGCGG 
CGGCTGCGGGCGCAGAGCGGAGATGCAGCGGCTTGGGGCCACCCTGCTGTGCCTGCTGCTGGCGG 
CGGCGGTCCCCACGGCCCCCGCGCCCGCTCCGACGGCGACCTCGGCTCCAGTCAAGCCCGGCCCG 
GCTCTCAGCTACCCGCAGGAGGAGGCCACCCTCAATGAGATGTTCCGCGAGGTTGAGGAACTGAT 
GGAGGACACGCAGCACAAA.T T GC GCAGCGCGGT GGAAGAGATGGAGGCAGAAGAAGCTGCTGC T A 
AAGCATCATCAGAAGTGAACC TGGCAAAC T TACCTCCCAGC TATCACAATGAGACCAACAGAGAC 
AC GAAGGTT GGAAATAATACCAT CCAT GTGCACCGAGAAAT TCACAAGATAACCAACAACCAGAC 
TGGACAAA.TGGT CT TT T CAGAGACAG T TATCACAT CT GTGGGAGACGAAGAAGGCAGAAGGAGCC 
ACGAGTGCATCATCGACGAGGACTGTGGGCCCAGCATGTACTGCCAGTTTGCCAGCTTCCAGTAC 
£3 ACCTGCCAGCCATGCCGGGGCCAGAGGATGCTCTGCACCCGGGACAGTGAGTGCTGTGGAGACCA 
O GCTGTGTGTCTGGGGTCACTGCACCAAAATGGCCACCAGGGGCAGCAATGGGACCATCTGTGACA 
f|! ACCAGAGGGACTGCCAGCCGGGGCTGTGCTGTGCCTTCCAGAGAGGCCTGCTGTTCCCTGTGTGC 
m ACACCCCTGCCCGTGGAGGGCGAGCTTTGCCATGACCCCGCCAGCCGGCTTCTGGACCTCATCAC 
%l CTGGGAGCTAGAGCCTGATGGAGCCTTGGACCGATGCCCTTGTGCCAGTGGCCTCCTCTGCCAGC 
l2 CCCACAGCCACAGCCTGGTGTATGTGTGCAAGCCGACCTTCGTGGGGAGCCGTGACCAAGATGGG 
GAGATCCTGCTGCCCAGAGAGGTGCCCGATGAGTATGAAGTTGGCAGCTTCATGGAGGAGGTGCG 
Ut CCAGGAGCTGGAGGACCTGGAGAGGAGCCTGACTGAAGAGATGGCGCTGGGGGAGCCTGCGGCTG 
s CCGCCGCTGCACTGCTGGGAGGGGAAGAGATT TAGA TCTGGACCAGGCTGTGGGTAGATGTGCAA 
C! TAGAAATAGCTAATTTATTTCCCCAGGTGTGTGCTTTAGGCGTGGGCTGACCAGGCTTCTTCCTA 
HI CATCTTCTTCCCAGTAAGTTTCCCCTCTGGCTTGACAGCATGAGGTGTTGTGCATTTGTTCAGCT 
g| CCCCCAGGCTGTTCTCCAGGCTTCACAGTCTGGTGCTTGGGAGAGTCAGGCAGGGTTAAACTGCA 
m GGAGCAGTTTGCCACCCCTGTCCAGATTATTGGCTGCTTTGCCTCTACCAGTTGGCAGACAGCCG 
f?| T T T GT TC TACAT GGCT TTGATAAT T GT TTGAGGGGAGGAGAT GGAAACAATGTGGAG T CT CCC T C 

|y TGATTGGTTTTGGGGAAATGTGGAGAAGAGTGCCCTGCTTTGCAAACATCAACCTGGCAAAAATG 
CAACAAATGAATTTTCCACGCAGTTCTTTCCATGGGCATAGGTAAGCTGTGCCTTCAGCTGTTGC 
AGATGAAATGTTCTGTTCACCCTGCATTACATGTGTTTATTCATCCAGCAGTGTTGCTCAGCTCC 
TACCTCTGTGCCAGGGCAGCATTTTCATATCCAAGATCAATTCCCTCTCTCAGCACAGCCTGGGG 
AGGGGGTCATTGTTCTCCTCGTCCATCAGGGATCTCAGAGGCTCAGAGACTGCAAGCTGCTTGCC 
CAAGT CACACAGCTAGTGAAGACCAGAGCAGT T TCATC TGGT T GT GACT C TAAGC TCAGTGCTCT 
CTCCACTACCCCACACCAGCCTTGGTGCCACCAAAAGTGCTCCCCAAAAGGAAGGAGAATGGGAT 
TT T TCT T GAGGCAT GCACATCTGGAATTAAGGT CAAAC TAAT T CT CACAT CCCT CTAAAAGTAAA 
CTACTGTTAGGAACAGCAGTGTTCTCACAGTGTGGGGCAGCCGTCCTTCTAATGAAGACAATGAT 
AT T GACACTGT CCCTCT T TGGCAGT T GCATTAGTAACT T TGAAAGGTATATGAC TGAGCGTAGCA 
TACAGGT TAACCT GCAGAAACAG TACT TAGGTAAT T GTAGGGC GAGGAT TATAAAT GAAAT TT GC 
AAAATCAC T TAGCAGCAAC TGAAGACAAT TAT CAACCACGTGGAGAAAAFCAAACCGAGCAGGGC 
TGTGTGAAACATGGTTGTAATATGCGACTGCGAACACTGAACTCTACGCCACTCCACAAATGATG 
TTTTCAGGTGTCATGGACTGTTGCCACCATGTATTCATCCAGAGTTCTTAAAGTTTAAAGTTGCA 
CATGAT TGTATAAGCAT GCT T TCT T TGAGT T T TAAAT TATGTATAAACATAAGT TGCAT T T AGAA 
AT CAAGCATAAAT CAC T TCAAC T GCAAAAAAAAAAAAAZUIAAAAAAAAAAA 
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FIGURE 8 

MQRLGATLLCLLLAAAVPTAPAPAPTATSAPVKPGPALSYPQEEATLNEMFREVEELMEDTQHKL 
RSAVEEMEAEEAAAKAS SEVNLANLPP S YHNETNTDTKVGNNT I HVHRE IHKI TNNQT GQMVFSE 
TVITSVGDEEGRRSHECIIDEDCGPSMYCQFAS FQYTCQPCRGQRMLCTRDSECCGDQLCWGHC 
TKMATRGSNGTICDNQRDC.QPGLCCAFQRGLLFPVCTPLPVEGELCHDPASRLLDLITWELEPDG 
ALDRCPCASGLLCQPHSHSLVYVCKPTFVGSRDQDGEILLPREVPDEYEVGSFMEEVRQELEDLE 
RSLTEEMALGEPAAAAAALLGGEEI 

Signal sequence : 
amino acids 1-19 

N-glycosylation site, 

amino acids 96-100, 106-110, 121-125, 204-208 
Casein kinase II phosphorylation site. 

amino acids 46-50, 67-71, 98-102, 135-139, 206-210, 312-316, 
327-331 

N-nryxistoylation site. 

amino acids 202-208, 217-223 

Amidation site. 

amino acids 140-144 
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FIGURE Q 

CGGACGCGTGGGCGGACGCGTGGGGGCTGTGAGAAAGTGCCAATAAATACATCATGCAACCCCAC 
GGCCCACCTTGTGAACTCCTCGTGCCCAGGGCTGATGTGCGTCTTCCAGGGCTACTCATCCAAAG 
GCCTAATCCAACGTTCTGTCTTCAATCTGCAAATCTATGGGGTCCTGGGGCTCTTCTGGACCCTT 
AACTGGGTACTGGCCCTGGGCCAATGCGTCCTCGCTGGAGCCTTTGCCTCCTTCTACTGGGCCTT 
CCACAAGCCCCAGGACATCCCTACCTTCCCCTTAATCTCTGCCTTCATCCGCACACTCCGTTACC 
ACACTGGGTCATTGGCATTTGGAGCCCTCATCCTGACCCTTGTGCAGATAGCCCGGGTCATCTTG 
GAGTATAT TGACCACAAGC TCAGAGGAGTGCAGAACCC T GTAGCCCGC TGCATCATGTGC TGTTT 
CAAGTGC TGCCT C T GGTGTC TGGAAAAATT TAT CAAGT TCCTAAACCGCAAT GCATACATCATGA 
TCGCCATCTACGGGAAGAATTTCTGTGTCTCAGCCAAAAATGCGTTCATGCTACTCATGCGAAAC 
ATTGTCAGGGTGGTCGTCCTGGACAAAGTCACAGACCTGCTGCTGTTCTTTGGGAAGCTGCTGGT 
GGTCGGAGGCGTGGGGGTCCTGTCCTTCTTTTTTTTCTCCGGTCGCATCCCGGGGCTGGGTAAAG 
ACTTTAAGAGCCCCCACCTCAACTATTACTGGCTGCCCATCATGACCTCCATCCTGGGGGCCTAT 
GTCATCGCCAGCGGCTTCTTCAGCGTTTTCGGCATGTGTGTGGACACGCTCTTCCTCTGCTTCCT 
GGAAGACCTGGAGCGGAACAACGGCTCCCTGGACCGGCCCTACTACATGTCCAAGAGCCTTCTAA 
AGAT TC T GGGCAAGAAGAACGAGGCGCCCCC GGACAACAAGAAGAGGAAGAAGTGACAGC TCCGG 
CCGTGATCCAGGACTGCACCCCACCCCCACCGTCCAGCCATCCAACCTCACTTCGCCTTACAGGT 
CTCCATTTTGTGGTAAAAAAAGGTTTTAGGCCAGGCGCCGTGGCTCACGCCTGTAATCCAACACT 
TTGAGAGGCT GAGGCGGGCGGAT CACCT GAGT CAGGAG T TC GAGACCAGCC TGGCCAACATGGT G 
AAACCTCCGTCTGTATTAAAAATACAAAAATTAGCCGAGAGTGGTGGCATGCACCTGTCATCCCA 
GCTACTCGGGAGGCTGAGGCAGGAGAATCGCTTGAACCCGGGAGGCAGAGGTTGCAGTGAGCCGA 
GAT CGCGCCACT GCAC TCCAACCT GGGTGACAGAC T CTGTC T CCAAAACAAAACAAACAAACAAA 
AAGAT T T TAT T AAAGATAT T T TGT TAACTC 
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FIGURE 10 



RTRGRTRGGCEKVPINTSCNPTAHLVNSSCPGLMCVFQGYSSKGLIQRSVFNLQIYGVLGLFWTL 
NOTLALGQCVLAGAFAS FYWAFHKPQDI PT FPLI SAFIRTLRYHTGSLAFGALILTLVQIARVIL 
EYIDHKLRGVQNPVARCIMCCFKCCLWCLEKFIKFLNRNAYIMIAIYGKNFCVSAKNAFMLLMRN 
IVRWVLDKYTDLLLFFGKLLWGGVGVLSFFFFSGRIPGLGKDFKSPHLNYYWLPIMTSILGAY 
VIASGFFSVFGMCVDTLFLCFLEDLERNWGSLDRPYYMSKSLLKILGKKNEAPPDNKKRKK 

Important features: 
Transmembrane domains: 

amino acids 57-80 (type II), 110-126, 215-231, 254-274 

N-glycosylation sites* 

amino acids 16-20, 27-31, 289-293 

Hypothetical YBR002c family proteins. 

amino acids 276-288 

Ammonium transporters proteins. 

amino acids 204-231 

N-myristoylation sites. 

amino acids 60-66, 78-84 

Amidation site. 

amino acids 306-310 
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FIGURE 11 



Id 



a ir'S 
ill 



GCCCCGCGCCCGGCGCCGGGCGCCCGAAGCCGGGAGCCACCGCCATGGGGGCCTGCCTGGGAGCCTGC 
TCCCTGCTCAGCTGCGCGTCCTGCCTCTGCGGCTCTGCCCCCTGCATCCTGTGCAGCTGCTGCCCCGC 
CAGCCGCAACTCCACCGTGAGCCGCCTCATCTTCACGTTCTTCCTCTTCCTGGGGGTGCTGGTGTCCA 
TCATTATGCTGAGCCCGGGCGTGGAGAGTCAGCTCTACAAGCTGCCCTGGGTGTGTGAGGAGGGGGCC 
GGGATCCCCACCGTCCTGCAGGGCCACATCGACTGTGGCTCCCTGCTTGGCTACCGCGCTGTCTACCG 
CATGTGCTTCGCCACGGCGGCCTTCTTCTTCTTCTTTTTCACCCTGCTCATGCTCTGCGTGAGCAGCA 
GCCGGGACCCCCGGGCTGCCATCCAGAATGGGTTTTGGTTCTTTAAGTTCCTGATCCTGGTGGGCCTC 
ACCGTGGGTGCCTTCTACATCCCTGACGGCTCCTTCACCAACATCTGGTTCTACTTCGGCGTCGTGGG 
CTCCTTCCTCTTCATCCTCATCCAGCTGGTGCTGCTCATCGACTTTGCGCACTGCTGGAACCAGCGGT 
GGCTGGGCAAGGCCGAGGAGTGCGATTCCCGTGCCTGGTACGCAGGCCTCTTCTTCTTCACTCTCCTC 
T T CTACT T G CT GT CGAT CGCGGCC GT G GC G C T GAT G T TCAT GT AC TACAC T GAG C C CAGC GGC T GC CA 
CGAGGGCAAGGTCTTCATCAGCCTCAACCTCACCTTCTGTGTCTGCGTGTCCATCGCTGCTGTCCTGC 



hh CCAAGGTCCAGGACGCCCAGCCCAACTCGGGTCTGCTGCAGGCCTCGGTCATCACCCTCTACACCATG 
If! TTTGTCACCTGGTCAGCCCTATCCAGTATCCCTGAA.CAGAAATGCAACCCCCATTTGCCAACCCAGCT 
B - GGG CAACGAGACAGT T GT GGCAGGC CC C GAGGGCT AT GAGACC CAGT G GT GGGAT GC C C C GAGCAT T G 

TGGGCCTCATCATCTTCCTCCTGTGCACCCTCTTCATCAGTCTGCGCTCCTCAGACCACCGGCAGGTG 
AACAGCCT GAT GCAGACCGAGGAGTGC CCACCTAT GCTAGACGCCACACAGCAGCAGCAGCAGCAGGT 
m GGCAGCCTGTGAGGGCCGGGCCTTTGACAACGAGCAGGACGGCGTCACCTACAGCTACTCCTTCTTCC 
r| ACTTCTGCCTGGTGCTGGCCTCACTGCACGTCATGATGACGCTCACCAACTGGTACAAGCCCGGTGAG 
f|| ACCCGGAAGATGATCAGCACGTGGACCGCCGTGTGGGTGAAGATCTGTGCCAGCTGGGCAGGGCTGCT 
CCTCTACCTGTGGACCCTGGTAGCCCCACTCCTCCTGCGCAACCGCGACTTCAGCTGAGGCAGCCTCA 
CAGCCTGCCATCTGGTGCCTCCTGCCACCTGGTGCCTCTCGGCTCGGTGACAGCCAACCTGCCCCCTC 
CCCACACCAATCAGCCAGGCTGAGCCCCCACCCCTGCCCCAGCTCCAGGACCTGCCCCTGAGCCGGGC 
CTTCTAGTCGTAGTGCCTTCAGGGTCCGAGGAGCATCAGGCTCCTGCAGAGCCCCATCCCCCCGCCAC 
ACCCACACGGTGGAGCTGCCTCTTCCTTCCCCTCCTCCCTGTTGCCCATACTCAGCATCTCGGATGAA 
AGGGC TCCCTTGTCCTCAGGCTCCACGGGAGCGGGGCTGCTGGAGAGAGCGGGGAACTCCCACCACAG 
TGGGGCATCCGGCACTGAAGCCCTGGTGTTCCTGGTCACGTCCCCCAGGGGACCCTGCCCCCTTCCTG 
GAC TTCGTGCCT T AC T G AGT C TC T AAG AC T T T T T C T AAT AAAC AAGC CAGT G C GT G T AAAAAAAA 
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FIGURE 12 



MGACLGACSLLSCASCLCGSAPCILCSCCPASRNSTVSRLIFTFFLFLGVLVSIIMLSPGVESQL 
YKLPWVCEEGAG I PT VLQGH I DCGS LL G YRAVYRMC FATAAFFFFFFTLLMLCVS S SRDPRAAI Q 
NGFWFFKFLILVGLTVGAFYIPDGSFTNIWFYFGWGSFLFILIQLVLLIDFAHSWNQRWLGKAE 
ECDSRAWYAGLFFFTLLFYLLSIAAVAmFMYYTEPSGCHEGKVFISLNLTFCVCVSIAAVLPKV 
QDAQPNSGLLQASVITLYTMFVTWS7VLSSIPEQKCNPHLPTQLGNETWAGPEGYETQWWDAPSI 
VGLI I FLLCTLFI SliRSSDHRQVNSLMQTEECPPMLDATQQQQQQVAACEGRAFDNEQDGVTYS Y 
SFFHFCLVIASLHVmTLTNWYKPGETRKMISTWTAWVKICASWAGLLLYLWTLVAPLLLRNRD 
FS 

Signal sequence : 

amino acids 1-20 

Transmembrane domains: 

amino acids 40-58, 101-116, 134-150, 162-178, 206-223, 240-257, 
272-283, 324-340, 391-406, 428-444 
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FIGURE is 



id 



01 



CGGGCCAGCCTGGGGCGGCCGGCCAGGAACCACCCGTTAAGGTGTCTTCTCTTTAGGGATGGTGA 
GGT TGGAAAAAGACT CC TGTAACCCT CC TCCAGGATGAACCACC TGCCAGAAGACAT GGAGAACG 
CTCTCACCGGGAGCCAGAGCTCCCATGCTTCTCTGCGCAATATCCATTCCATCAACCCCACACAA 
C T CATGGCCAGGATT GAGTCCTAT GAAGGAAGGGAAAAGAAAGGCAT ATC TGATGTCAGGAGGAC 
TTTCTGTTTGTTTGTCACCTTTGACCTCTTATTCGTAACATTACTGTGGATAATAGAGTTAAATG 
TGAATGGAGGCATTGAGAACACAT TAGAGAAGGAGGTGATGCAGTATGACTACTATTCT TCATAT 
T T T GATATAT TT CT TCT GGCAGT T T T TCGAT T TAAAG T GTTAATAC T TGCATATGC T GT GTGCAG 
ACTGCGCCATTGGTGGGCAATAGCGTTGACAACGGCAGTGACCAGTGCCTTTTTACTAGCAAAAG 
TGATCCTTTCGAAGCTTTTCTCTCAAGGGGCTTTTGGCTATGTGCTGCCCATCATTTCATTCATC 
CTT GCCTGGATT GAGACGTGGT T CCTGGAT T TCAAAGTGTTACC T CAAGAAGCAGAAGAAGAAAA 
CAGACTCCT GATAGTTCAGGAT GCTTCAGAGAGGGCAGCAC T T ATACC TGGT GGTCT TT CTGAT G 
GTCAGTTTTATTCCCCTCCTGAATCCGAAGCAGGATCTGAAGAAGCTGAAGAAAAACAGGACAGT 
GAGAAACCACTTTTAGAACT ATGA GTAC TACTTT TGTTAAATGTGAAAAACCCTCACAGAAAGTC 
ATCGAGGCAAAAAGAGGCAGGCAGTGGAGTCTCCCTGTCGACAGTAAAGTTGAAATGGTGACGTC 
CACTGC TGGCTTTATTGAACAGCTAATAAAGATT TATT TATTGTAATACCTCACAAACGT TGTAC 
CATATCCATGCACATTTAGTTGCCTGCCTGTGGCTGGTAAGGTAATGTCATGATTCATCCTCTCT 
TCAGTGAGACTGAGCCTGATGTGTTAACAAATAGGTGAAGAAAGTCTTGTGCTGTATTCCTAATC 
AAAAGACTTAATATATTGAAGTAACACT TTTT TAGTAAGCAAGATACCT TTTTATTTCAATTCAC 
AGAATGGAATTTTTTTGTTTCATGTCTCAGATTTATTTTGTATTTCTTTTTTAACACTCTACATT 
TCGCTTGTTTTTTAACTCATGCACATGTGCTCTTTGTACAGTTTTAAAAAGTGTAATAAAATCTG 
ACATGTCAATGTGGCTAGTTTTATTTTTCTTGTTTTGCATTATGTGTATGGCCTGAAGTGTTGGA 
CT TGCAAAAGGGGAAGAAAGGAAT T GCGAATACATGTAAAATGTCACCAGACATT T GTAT TAT T T 
TTATCATGAAATCATGTTTTTCTCTGATTGTTCTGAAATGTTCTAAATACTCTTATTTTGAATGC 
ACAAAATGAC TTAAACCAT TCATAT CAT GT T T CCT T TGCGT TCAGCCAAT TTCAAT TAAAATGAA 
CTAAATTAAAAA 
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MNHLPEDMENALTGSQSSHASLRNIHSINPTQLMARIESYEGREKKGISDVRRTFCLFVTFDLLF 
VTLLW 1 1 ELNVNGGI ENTLEKEVMQYDY YS S YFD I FLLAVFRFKVL I LAYAVCRLRHWWAI ALT T 
AVTSAFLLAKVILSKLFSQGAFGYVLPIISFILAWIETWFLDFKVLPQEAEEENRLLIVQDASER 
AAL I PGGLSDGQFYSPPESEAGSEEAEEKQDSEKPLLEL 

Important features of the protein: 
Signal peptide: 

amino acids 1-20 



gl Transmembrane domains: 

jUJ amino acids 54-72, 100-118, 130-144, 146-166 

^ N-myristoylation sites. 

amino acids 14-20, 78-84, 79-85, 202-208, 217-223 



n; 
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ACTCGAACGCAGTTGCTTCGGGACCCAGGACCCCCTCGGGCCCGACCCGC.CAGGAAAGACTGAGG 
CCGCGGCCTGCCCCGCCCGGCTCCCTGCGCCGCCGCCGCCTCCCGGGACAGAAGATGTGCTCCAG 
GGTCCCTCTGCTGCTGCCGCTGCTCCTGCTACTGGCCCTGGGGCCTGGGGTGCAGGGCTGCCCAT 
CCGGCTGCCAGTGCAGCCAGCCACAGACAGTCTTCTGCACTGCCCGCCAGGGGACCACGGTGCCC 
CGAGACGTGCCACCCGACACGGTGGGGCTGTAGGTCTTTGAGAACGGCATCACCATGCTCGACGC 
AGGCAGCTTTGCCGGCCTGCCGGGCCTGCAGCTCCTGGACCTGTCACAGAACCAGATCGCCAGCC 
TGCCCAGCGGGGTCTTCCAGCCACTCGCCAACCTCAGCAACCTGGACCTGACGGCCAACAGGCTG 
CATGAAATCACCAATGAGACCTTCCGTGGCCTGCGGCGCCTCGAGCGCCTCTACCTGGGCAAGAA 
CCGCATCCGCCACATCCAGCCTGGTGCCTTCGACACGCTCGACCGCCTCCTGGAGCTCAAGCTGC 
AGGACAACGAGCTGCGGGCACTGCCCCCGCTGCGCCTGCCCCGCCTGCTGCTGCTGGACCTCAGC 
U CACAACAGCCTCCTGGCCCTGGAGCCCGGCATCCTGGACACTGCCAACGTGGAGGCGCTGCGGCT 
f;l GGCTGGTCTGGGGCTGCAGCAGCTGGACGAGGGGCTCTTCAGCCGCTTGCGCAACCTCCACGACC 
TGGATGTGTCCGACAACCAGCTGGAGCGAGTGCCACCTGTGATCCGAGGCCTCCGGGGCCTGACG 
S;f CGCCTGCGGCTGGCCGGCAACACCCGCATTGCCCAGCTGCGGCCCGAGGACCTGGCCGGCCTGGC 
W| TGCCCTGCAGGAGCTGGATGTGAGCAACCTAAGCCTGCAGGCCCTGCCTGGCGACCTCTCGGGCC 
h! TCTTCCCCCGCCTGCGGCTGCTGGCAGCTGCCCGCAACCCCTTCAACTGCGTGTGCCCCCTGAGC 
\f TGGTTTGGCCCCTGGGTGCGCGAGAGCCACGTCACACTGGCCAGCCCTGAGGAGACGCGCTGCCA 
ij" CTTCCCGCCCAAGAACGCTGGCCGGCTGCTCCTGGAGCTTGACTACGCCGACTTTGGCTGCCCAG 
L.,'- CCACCACCACCACAGCCACAGTGCCCACCACGAGGCCCGTGGTGCGGGAGCCCACAGCCTTGTCT 
TCTAGCTTGGCTCCTACCTGGCTTAGCCCCACAGCGCCGGCCACTGAGGCCCCCAGCCCGCCCTC 
• CACTGCCCCACCGACTGTAGGGCCTGTCCCCCAGCCGCAGGACTGCCCACCGTCCACCTGCCTCA 
Q ATGGGGGCACATGCCACCTGGGGACACGGCACCACCTGGCGTGCTTGTGCCCCGAAGGCTTCACG 
|p GGCCTGTACTGTGAGAGCCAGATGGGGCAGGGGACACGGCCCAGCCCTACACCAGTCACGCCGAG 
S| GCCACCACGGTCCCTGACCCTGGGCATCGAGCCGGTGAGCCCCACCTCCCTGCGCGTGGGGCTGC 
W AGCGCTACCTCCAGGGGAGCTCCGTGCAGCTCAGGAGCCTCCGTCTCACCTATCGCAACCTATCG 
s&l GGCCCTGATAAGCGGCTGGTGACGCTGCGACTGCCTGCCTCGCTCGCTGAGTACACGGTCACCCA 
g]f GCTGCGGCCCAACGCCACTTACTCCGTCTGTGTCATGCCTTTGGGGCCCGGGCGGGTGCCGGAGG 
fill GCGAGGAGGCCTGCGGGGAGGCCCATACACCCCCAGCCGTCCACTCCAACCACGCCCCAGTCACC 
CAGGCCCGCGAGGGCAACCTGCCGCTCCTCATTGCGCCCGCCCTGGCCGCGGTGCTCCTGGCCGC 
GCTGGCTGCGGTGGGGGCAGCCTACTGTGTGCGGCGGGGGCGGGCCATGGCAGCAGCGGCTCAGG 
ACAAAGGGCAGGTGGGGCCAGGGGCTGGGCCCCTGGAACTGGAGGGAGTGAAGGTCCCCTTGGAG 
CCAGGCCCGAAGGCAACAGAGGGCGGTGGAGAGGCCCTGCCCAGCGGGTCTGAGTGTGAGGTGCC 
ACTCATGGGCTTCCCAGGGCCTGGCCTCCAGTCACCCCTCCACGCAAAGCCCTACATCTAAGCCA 
GAGAGAGACAGGGCAGCTGGGGCCGGGCTCTCAGCCAGTGAGATGGCCAGCCCCCTCCTGCTGCC 
ACACCACG TAAGT TCTCAGTCCCAACC T CGGGGAT GTG TGCAGACAGGGCTGT GTGACCACAGCT 
GGGCCCTGTTCCCTCTGGACCTCGGTCTCCTCATCTGTGAGATGCTGTGGCCCAGCTGACGAGCC 
CTAACGTCCCCAGAACCGAGTGCCTATGAGGACAGTGTCCGCCCTGCCCTCCGCAACGTGCAGTC 
CCTGGGCACGGCGGGCCCTGCCATGTGCTGGTAACGCATGCCTGGGTCCTGCTGGGCTCTCCCAC 
TCCAGGCGGACCCTGGGGGCCAGTGAAGGAAGCTCCCGGAAAGAGCAGAGGGAGAGCGGGTAGGC 
GGCT GT GT GACT C TAGT CT T GGCCC CAGGAAGCGAAGGAACAAAAGAAACTGGAAAGGAAGATGC 
TT TAGGAACATGT TTTGCTT TTTTAAAATATATATATT TATAAGAGATCCTTTCCCATTTATTCT 
GGGAAGATGT TT T TCAAAC T CAGAGACAAGGACT TTGGTTTTT GTAAGACAAACGATGATATGAA 
GG C C T T T T GTAAGAAAAAAT AAAAGAT GAAGT G T GAAA 
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FIGURE 16 



MCSRVPLLLPLLLLLALGPGVQGCPSGCQCSQPQTVFCTARQGTTVPRDVPPDTVGLYVFENGIT 
MLDAGSFAGLPGLQLLDLSQNQIASLPSGVFQPLANLSNLDLTANRLHEITNETFRGLRRLERLY 
LGKNRIRHIQPGAFDTLDRLLELKLQDNELRALPPLRLPRLLLLDLSHNSLLALEPGILDTANVE 
ALRLAGLGLQQLDEGLFSRLRNLHDLDVSDNQLERVPPVIRGLRGLTRLRLAGNTRIAQLRPEDL 
AGIAALQELDVSNLSLQALPGDLSGLFPRLRLLAAAR^ 

TRCHFPPKNAGRLLLELDYADFGCPATTTTATVPTTRPWREPTALSSSLAPTWLSPTAPATEAP 

SPPSTAPPTVGPVPQPQDCPPSTCLNGGTCHLGTRHHLACLCPEGFTGLYCESQMGQGTRPSPTP 

VTPRPPRSLTLGIEPVSPTSLRVGLQRYLQGSSVQLRSLRLTYRNLSGPDKRLVTLRLPASLAEY 

TVTQLRPNATYSVCVMPLGPGRVPEGEEACGEAHTPPAVHSNHAPVTQAREGNLPLLIAPALAAV 

LLAAIAAVGAAYCVRRGRAMAAAAQDKGQVGPGAGPLELEGVKVPLEPGPKATEGGGEALPSGS 

CEVPLMGFPGPGLQS PLHAKPY I 

Important features: 
Signal peptide: 

amino acids 1-23 
Transmembrane domain: 

amino acids 579-599 

EGF-like domain cysteine pattern signature. 

amino acids 430-442 
Leucine zipper pattern, 
amino acids 197-219, 269-291 
N-glycosylation sites. 

amino acids 101-105, 117-121, 273-277, 500-504, 528-532 
Tyrosine kinase phosphorylation sites, 
amino acids 124-131, 337-345 
N-myristoylation sites. 

amino acids 23-29, 27-33, 70-76, 142-148, 187-193, 348-354, 
594-600, 640-646 
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GCAGCGGCGAGGCGGCGGTGGTGGCTGAGTCCGTGGTGGCAGAGGCGAAGGCGACAGCTC ATG CG 
GGTCCGGATAGGGCTGACGCTGCTGCTGTGTGCGGTGCTGCTGAGCTTGGCCTCGGCGTCCTCGG 
AT GAAGAAGGCAGCCAGGAT GAATCC T TAGAT TC CAAGAC TACT T TGACATCAGAT GAG TCAGTA 
AAGGAC CATACTACT GCAGGCAGAGTAGTTGCTGGT CAAATATT T CT T GAT TCAGAAGAATC TGA 
AT TAGAATCCT CTAT T CAAGAAGAGGAAGACAGC C TCAAGAGCCAAGAGGGGGAAAGT GTCACAG 
AAGATATCAGCTTTCTAGAGTCTCCAAATCCAGAAAACAAGGACTATGAAGAGCCAAAGAAAGTA 
CGGAAACCAGCTTTGACCGCCATTGAAGGCACAGCACATGGGGAGCCCTGCCACTTCCCTTTTCT 
T T T CCTAGATAAGGAGTAT GAT GAAT GTACATCAGATGGGAGGGAAGATGGCAGACTGTGGT GT G 
C TACAACC TATGACTACAAAGCAGAT GAAAAGT GGGGCT TT TGT GAAACTGAAGAAGAGGCT GCT 
AAGAGACGGCAGATGCAGGAAGCAGAAATGATGTATCAAACTGGAATGAAAATCCTTAATGGAAG 
CAATAAGAAAAGCCAAAAAAGAGAAGCATATCGGTATCTCCAAAAGGCAGCAAGCATGAACCATA 
CCAAAGCCC TGGAGAGAGTGTCATATGCTCTTTTATT TGGTGAT TACT TGCCACAGAATATCCAG 
GCAGCGAGAGAGATGT TTGAGAAGCTGACTGAGGAAGGCTCTCCCAAGGGACAGACTGC TCTTGG 
CTTTCTGTATGCCTCTGGACTTGGTGTTAATTCAAGTCAGGCAAAGGCTCTTGTATATTATACAT 
Q TT GGAGC T C TTGGGGGCAAT CTAAT AGCCCACAT GGT T T TGGTAAGTAGAC T T TAG T GGAAGGCT 

Ul AATAATATTAACAT CAGAAGAAT T T GTGGT T T ATAGCGGCCACAACT T T TT CAGC TT TCATGATC 

ijj CAGATTTGCTTGTATTAAGACCAAATATTCAGTTGAACTTCCTTCAAATTCTTGTTAATGGATAT 
fa AACACATGGAATC TACATGTAAAT GAAAGT T GGT GGAGTCCACAATT TT TC T T TAAAAT GAT TAG 

Hi TTTGGCTGATTGCCCCTAAAAAGAGAGATCTGATAAATGGCTCTTTTTAAATTTTCTCTGAGTTG 
GAATTGTCAGAATCATTTTT TACATTAGATTATCATAATTTTAAAAATT TT TCTTTAGTTTT TCA 
AAATTT TGTAAATGGTGGCTATAGAAAAACAACATGAAATAT TATACAATATTTTGCAACAATGC 
CCTAAGAAT T GT TAAAAT T CATGGAGT TATT T GT GCAGAATGAC T CCAGAGAGCT C TAC T T TC T G 
T TT TT TAC T T TT CATGAT T GGCTGT CT T CCCAT T TAT T CTGGT CAT T TAT TGC TAG T GACAC TGT 
GCCTGCTTCCAGTAGTCTCATTTTCCCTATTTTGCTAATTTGTTACTTTTTCTTTGCTAATTTGG 
AAGATTAAC T CATT TT TAATAAAAT TAT GTC TAAGAT TAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



m 
si 

hi 
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MRVRIGLTLLLCAVLLSLASASSDEEGSQDESLDSKTTLTSDESVKDHTTAGRWAGQIFLDSEESEL 
ESSIQEEEDSLKSQEGESVTEDISFLESPNPENKDYEEPKKVRKPALTAIEGTAHGEPCHFPFLFLDK 
EYDECTSDGREDGRLWCATTYDYKADEKWGFCETEEEAAKRRQMQEAEMMYQTGMKILNGSNKKSQKR 
EAYRYLQKAASMNHTKALERVSYALLFGDYLPQNIQAAREMFEKLTEEGSPKGQTALGFLYASGLGVN 
S SQAKALVY YT FGALGGNL I AHMVLVS RL 

Important features: 
Signal peptide: 

amino acids 1-21 

N-glycosylation sites. 

amino acids 195-199, 217-221, 272-276 

Tyrosine kinase phosphorylation site. 

amino acids 220-228 

N-myristoylation sites. 

amino acids 120-126, 253-259, 268-274, 270-274, 285-291, 289-295 

Glycosaminoglycan attachment site. 

amino acids 267-271 

Microbodies C-terminal targeting signal. 

amino acids 299-303 

Type II fibronectin collagen -binding domain protein. 

amino acids 127-169 

Fructose-bisphosphate aldolase class-II protein. 

amino acids 101-119 
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AAT T CAGAT T T T AAGC CCAT TC T GCAGTGGAAT T T CAT GAAC T AGCAAGAGGACACC ATC T TC T T 
GTAT TATACAAGAAAGGAGTGT ACCTATCACACACAGGGGGAAAAATGC TC T T T T GGGTGCTAGG 
CCTCCTAATCCTCTGTGGTTTTCTGTGGACTCGTAAAGGAAAACTAAAGATTGAAGACATCACTG 
ATAAGTACATTTTTATCACTGGATGTGACTCGGGCTTTGGAAACTTGGCAGCCAGAACTTTTGAT 
AAAAAGGGAT T T CAT GT AAT CGC TGCCTGTCT GACTGAATCAGGAT CAACAGCTT TAAAGGCAGA 
AAC CTCAGAGAGACT TCGTACT GT GCT T CTGGAT GTGACCGACCCAGAGAATGTCAAGAGGAC TG 
CCCAGTGGGTGAAGAACCAAGTTGGGGAGAAAGGTCTCTGGGGTCTGATCAATAATGCTGGTGTT 
CCCGGCGTGCTGGCTCCCACTGACTGGCTGACACTAGAGGACTACAGAGAACCTATTGAAGTGAA 
C CTGT T TGGACT CAT CAGT GTGACACTAAATAT GCTT CCT T T GGTCAAGAAAGCT CAAGGGAGAG 
TTATTAATGTCTCCAGTGTTGGAGGTCGCCTTGCAATCGTTGGAGGGGGCTATACTCCATCCAAA 
TATGCAGTGGAAGGTTTCAATGACAGCTTAAGACGGGACATGAAAGCTTTTGGTGTGCACGTCTC 
ATGCAT TGAACCAGGATTGT TCAAAACAAACTTGGCAGATCCAGTAAAGGTAATTGAAAAAAAAC 
T CGCCATT TGGGAGCAGCT GTCT CCAGACATCAAACAACAATAT GGAGAAGGT TACAT T GAAAAA 
AGTCTAGACAAACTGAAAGGCAATAAATCCTATGTGAACATGGACCTCTCTCCGGTGGTAGAGTG 
CATGGACCACGC TC TAACAAGT CT CT TCCC TAAGACTCATTATGCCGC T GGAAAAGAT GCCAAAA 
TT TT CT GGATACCT CT GTC T CACATGCCAGCAGC T T T GCAAGAC T T T T TAT TGTTGAAACAGAAA 
GCAGAGC TGGCTAAT CCCAAGGCAGT GTGAC TCAGCTAACCACAAATGTCTCCTCCAGGC TATGA 
AATTGGCCGATTTCAAGAACACATCTCCTTTTCAACCCCATTCCTTATCTGCTCCAACCTGGACT 
CAT TTAGATCGT GCT TATT T GGAT TGCAAAAGGGAGT CCCACCAT CGC TGGTGGTATCCCAGGGT 
CCCTGCTCAAGTTTTCTTTGAAAAGGAGGGCTGGAATGGTACATCACATAGGCAAGTCCTGCCCT 
GTATTTAGGCTTTGCCTGCTTGGTGTGATGTAAGGGAAATTGAAAGACTTGCCCATTCAAAATGA 
TCTTTACCGTGGCCTGCCCCATGCTTATGGTCCCCAGCATTTACAGTAACTTGTGAATGTTAAGT 
ATCATCTCTTATCTAAATATTAAAAGATAAGTCAACCCAAAAA7VAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAA 
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MLFWVLGLLILCGFLWTRKGKLKIEDITDKYIFITGCDSGFGNLAARTFDKKGFHVIAACLTESG 

STALKMTSERLRTVLLDVTDPENVKRTAQWKNQVGEKGLWGLINNAGVPGVIAPTDWLTLED^ 

REPIEWLFGLISVTLmLPLVKKAQGRVINVSSVGGRLAIVGGGYTPSKYAVEGFNDSLRRDMK 

AFGVHVSCIEPGLFKTNLAI)PVKVIE 

LSPWECMDHALTSLFPKTHYAAGKDAKIFWIPL^ 

Important features of the protein : 
Signal peptide: 

amino acids 1-17 

Transmembrane domain: 

amino acids 136-152 

N-glycosylation sites. 

amino acids 161-163, 187-190 and 253-256 

Glycosaminoglycan attachment site. 

amino acids 39-42 

N-myristoylation sites. 

amino acids 36-41/ 42-47/ 108-113, 166-171, 198-203 and 207-212 
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CTGAGGCGGCGGTAGCATGGAGGGGGAGAGTACGTCGGCGGTGCTCTCGGGCTTTGTGCTCGGCG 
CACTCGCTTTCCAGCACCTCAACACGGACTCGGACACGGAAGGTTTTCTTCTTGGGGAAGTAAAA 
GGT GAAGCCAAGAACAGCAT T AC T GAT TCC CAAAT GGATGATGT TGAAGT T GT TTATACAAT TGA 
CATTCAGAAATATAT TCCATGCTATCAGCT TTTTAGCT T TTATAAT TC TTCAGGCGAAGTAAATG 
AGCAAGCACTGAAGAAAATATTATCAAATGTCAAAAAGAATGTGGTAGGTTGGTACAAATTCCGT 
CGTCATTCAGATCAGATCATGACGTTTAGAGAGAGGCTGCT TCACAAAAACT TGCAGGAGCATTT 
TTCAAACCAAGACCTTGT T T T TCT GCTATTAACACCAAGTATAATAACAGAAAGCT GC TCTAC TC 
PI ATCGACTGGAACAT TCCTTATATAAACCTCAAAAAGGACTTT T TCACAGGGTACCT TTAGTGGTT 

f"l GCCAATCTGGGCATGTCTGAACAACTGGGTTATAAAACTGTATCAGGTTCCTGTATGTCCACTGG 
T T T TAGCCGAGCAGTACAAACACACAGC TC TAAAT T T T TTGAAGAAGATGGATCCT TAAAGGAGG 

1 *5 

T ACATAAGATAAAT GAAATGTAT GC TT CAT TACAAGAGGAAT TAAAGAGTATATGCAAAAAAGTG 
. ~f GAAGACAGTGAACAAGCAGTAGATAAAC TAGTAAAGGATGTAAACAGAT TAAAACGAGAAAT TGA 

g?| GAAAAGGAGAGGAGCACAGAT TCAGGCAGCAAGAGAGAAGAACATCCAAAAAGACCC T CAGGAGA 

3 ACATTTTTCTTTGTCAGGCATTACGGACCTTTTTTCCAAATTCTGAATTTCTTCATTCATGTGTT 
Q AT GT C T T TAAAAAATAGACATGTT TCTAAAAGTAGCTGTAACTACAACCACCATCT CGATGTAGT 

| Sift 

|^ AGACAATCT GACCT TAATGGTAGAACACAC TGACAT TCC TGAAGCTAGT CCAGCTAGTACACCAC 

S AAATCAT TAAGCATAAAGCC T TAGACT TAGAT GACAGAT GGCAAT T CAAGAGATCTCGGTTGT TA 

pi GATACACAAGACAAACGATCTAAAGCAAATACTGGTAGTAGTAACCAAGATAAAGCATCCAAAAT 
f||; GAGCAGCCCAGAAACAGATGAAGAAAT TGAAAAGATGAAGGGT TT TGGTGAATATTCACGGTCTC 

CTACAT T T TGAT CCT T T TAACCT TACAAGGAGAT T T T T TTAT T TGGCTGATGGGTAAAGCCAAAC 
AT T TCTAT TGTT T T TAC TATGTT GAGC TAC T T GCAGTAAG T TCAT T T GT T T T TAC TATGT TCACC 
TGTTTGCAGTAATACACAGATAACTCT TAGTGCAT TTACTTCACAAAGTACTTTT TCAAACATCA 
GAT GCT TT TATT T CCAAACC T TT T T TT CACC T T TCACTAAGT TGTT GAGGGGAAGGCTTACACAG 
ACACAT TCT T TAGAAT T GGAAAAGT GAGACCAGGCACAGTGGCT CACACCTGTAATCCCAGCACT 
TAGGGAAGACAAGTCAGGAGGAT T GAT TGAAGCTAGGAGT TAGAGACCAGCC TGGGCAACGTAT T 
GAGACCATGTCTATTAAAAAATAAAATGGAAAAGCAAGAATAGCCT TATT TTCAAAATATGGAAA 
GAAAT T TATAT GAAAAT T TATC T GAGT CAT TAAAAT TC TCC T TAAGTGATAC T T T T T TAGAAGTA 
CAT TATGGC TAGAGTT GCCAGATAAAAT GC T GGATATCATGCAATAAAT T TGCAAAACAT CATCT 
AAAATTTAAAAAAAAAAAAAAAAAAAAAA 
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MEGES TSAVLSGFVLGALAFQHLNTDSDTEGFLLGEVKGEAKNS I TDS QMDDVEWYT I DI QKYI 
PCYQLFS FYNS SGEVNEQALKKI LSNVKKNWGWYKFRRHSDQIMT FRERLLHKNLQEHFSNQDL 
VFLLLTPS 1 1 TESCSTHRLEHSLYKPQKGLFHRVPLWANLGMSEQLGYKT VSGSCMSTGFSRAV 
QTHSSKFFEEDGS LKE VHK I NEMYAS L QE E L KS I CKKVE D S E QAVDKL VKDVNRLKRE I E KRRGA 
QI QAAREKNI QKDPQEN I FLCQALRT FFPNSE FLHSCVMS LKNRHVSKS SCNYNHHLDWDNLTL 
MVEHTDIPEASPASTPQIIKHKALDLDDRWQFKRSRLLDTQDKRSKANTGSSNQDKASKMSSPET 
DEE IEKMKGFGE YSRS PTF 

Important features: 
Signal peptide: 

amino acids 1-19 

N-glycosylation sites . 

amino acids 75-79, 322-326 

N-myristoylation site. 

amino acids 184-154 

Growth factor and cytokines receptors family. 

amino acids 134-150 
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GGCACAGC CGCGC GGCGGAGGGCAGAGT CAG CCGAGC CGAGT C CAGC CGGAC GAGC GGAC CAGC GCAGGGCAGCCCAA 
GCAGCGCGCAGCGAACGCCCGCCGCCGCCCACACCCTCTGCGGTCCCCGCGGCGCCTGCCACCCTTCCCTCCTTCCCC 
GCGTCCCCGCCTCGCCGGCCAGTCAGCTTGCCGGGTTCGCTGCCCCGCGAAACCCCGAGGTCACCAGCCCGCGCCTCT 
GCTTCCCTGGGCCGCGCGCCGCCTCCACGCCCTCCTTCTCCCCTGGCCCGGCGCCTGGCACCGGGGACCGTTGCCTGA 
CGCGAGGCCCAGCTCTACTTTTCGCCCCGCGTCTCCTCCGCCTGCTCGCCTCTTCCACCAACTCCAACTCCTTCTCCC 
TCCAGCTCCACTCGCTAGTCCCCGACTCCGCCAGCCCTCGGCCCGCTGCCGTAGCGCCGCTTCCCGTCCGGTCCCAAA 
GGTGGGAACGCGTCCGCCCCGGCCCGCACCATGGCACGGTTCGGCTTGCCCGCGCTTCTCTGCACCCTGGCAGTGCTC 
AGCGCCGCGCTGCTGGCTGCCGAGCTCAAGTCGAAAAGTTGCTCGGAAGTGCGACGTCTTTACGTGTCCAAAGGCTTC 
|M. AACAAGAACGATGCCCCCCTC CACGAGATCAACGGT GAT CATTT GAAGATCT GT CC CCAGGGTTCTACCTGCTGCT CT 

13 CAAGAGAT GGAGGAGAAGTACAGCCT GCAAAGTAAAGAT GATTT CAAAAGT GT GGT CAGCGAACAGT GCAAT CATTTG 

CI CAAGCT GT CTT T GCTT CAC GTTACAAGAAGTTTGATGAATT CTT CAAAGAACTACT TGAAAATGCAGAGAAAT CC CTG 

fek AAT GATAT GTTT GT GAAGACATAT GGCCATTTATACATGCAAAATT CTGAGCTATTTAAAGATCT CT TCGTAGAGTT G 

til AAAC GT TACTACGT GGTGGGAAAT GT GAAC CTGGAAGAAAT GCTAAATGACTT CT GGGCT C GCCT C CTGGAGCGGAT G 

T T CC GCCT GGT GAACTCC CAGTACCACTT TACAGAT GAGTAT CT GGAAT GT GT GAGCAAGTATACGGAGCAGCT GAAG 
CCCTTCGGAGATGTCCCTCGCAAATTGAAGCTCCAGGTTACTCGTGCTTTTGTAGCAGCCCGTACTTTCGCTCAAGGC 

m 

*H T TAGCGGTT GCGGGAGAT GT C GT GAGCAAGGTCTCCGTGGTAAACCCCACAGCC CAGTGTACCCAT GCC CT GTTGAAG 

* ATGATCTACTGCTCCCACTGCCGGGGTCTCGTGACTGTGAAGCCATGTTACAACTACTGCTCAAACATCATGAGAGGC 
TGTTTGGCCAACCAAGG GGAT CT C GATTT TGAAT GGAACAATTT CATAGAT GCTAT GCTGAT GGT GGCAGAGAGG CTA 
W$ GAGGGTCCT TT CAACATT GAAT CGGTCATGGATCC CATC GAT GT GAAGATTT CT GAT GCTAT TAT GAACAT GCAGGAT 

1*1 AATAGTGTTCAAGT GTCT CAGAAGGTTTT CCAGGGAT GT GGACC CCC CAAGC CC CT C CCAGCTGGACGAATTTCT CGT 

fol T C CATCT CTGAAAGTGC CTTCAGT GCTCGCT T CAGACCACATCACCC CGAGGAACGCCCAACCACAG CAGCT GGCACT 

1*1 AGTTT GGACCGACT GGTTACT GAT GT CAAG GAGAAACTGAAACAGGC CAAGAAATTCTGGT CCT CC CTTCCGAGCAAC 

s M GTTT GCAACGAT GAGAGGAT GGCT GCAGGAAACGGCAAT GAGGAT GACT GT T GGAAT GGGAAAGGCAAAAGCAGGT AC 

CTGTTTGCAGTGACAGGAAAT GGATTAGCCAACCAGG GCAACAAC CCAGAGGTC CAGGTT GACACCAGCAAACCAGAC 
ATACT GAT CCTT CGTCAAAT CAT GGCT CTT CGAGT GATGACCAGCAAGAT GAAGAAT GCATACAATGGGAACGAC GT G 
GACTTCTT TGATATCAGTGAT GAAAGTAGT GGAGAAGGAAGT GGAAGT GGCT GT GAGTAT CAGCAGT GCCCTTCAGAG 
TTTGACT ACAAT GC CACTGACCAT GCT GGGAAGAGTGCCAAT GAGAAAGC CGACAGT GCT GGTGT C CGT CCTGGGGCA 
CAGGCCTACCTCCTCACTGTCTTCTGCATCTTGTTCCTGGTTATGCtAGAGAGAGTGGAGATAATTCTCAAACTCTGAG 
AAAAAGTGTTCATCAAAAAGTTAAAAGGCACCAGTTATC^CTTTTCTACCATCCTAGTGACTTTGCTTTTTAAATGAA 
T GGAGAACAAT GTACAGTTT TTACTAT GT GGCCACT GGTTTAAGAAGTGCT GACTTTGTT TT CT CAT TCAGTTT TGGG 
AGGAAAAGGGACTGT GCATT GAGTTGGTT C CTGCT CC CCCAAACCATGTTAAAC GT GGCTAACAGT GTAGGT ACAGAA 
CT AT AGT TAGTT GT GCATTT GTGATTT TAT CACT CTATTATTTGTT TGTAT GTTTTTTT CT CATT TC GTTTGTGGGTT 
T TTTTT T CCAACTGTGATCTCGCCTT GTTT CTTACAAGCAAACCAGGGT CCCTT CTT GGCAC GTAACAT GTACGTATT 
T CTGAAATATTAAATAGCT GTACAGAAGCAGGTT TTATTTAT CAT GTTAT CTTATTAAAAGAAAAAGCCCAAAAAGC 
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FIGURE 24 

MARFGLPALLCTIAVLSAALIAM 

CCSQEMEEKYSLQSKDDFKSWSEQCNHLQAVFASRYKKFDEFFKELLENAEKSLNDMFVKTYGH 
LYMQNSELFKDLFVELKRYYWG3SFVNLEEMLNDFWARLLERMFRLWSQYHFTDEYLECVSKYTE 
QLKPFGDVPRKLKLQVTRAFVAARTFAQGLAVAGDWSKVSWNPTAQCTHALLKMIYCSHCRGL 
VTVKPCYNYCSNIMRGCLANQGDLD^ 

MQDNSVQVSQKVFQGCGPPKPLPAGRISRSISESAFSARFRPHHPEERPTTAAGTSLDRLVTDVK 
EKLKQAKKFWSSLPSNVCNDER3M^GNGNEDDCWNGKGKSRYLFAVTGNGLANQGNNPEVQVDTS 
KPDILILRQIMALRVMTSKMKNAYNGNDVDFFDISDESSGEGSGSGCEYQQCPSEFDYNATDHAG 
KSANEKADSAGVRPGAQAYLLTVFC I LFLVMQREWR 

Important features: 
Signal peptide: 

amino acids 1-22 

ATP/GTP-binding site motif A (P-loop) • 

amino acids 515-524 

N-glycosylation site. 

amino acids 514-518 

Glycosaminoglycan attachment sites . 

amino acids 494-498, 498-502 

N-myristoylation sites. 

amino acids 63-69, 224-230, 276-282, 438-444, 497-503, 531-537 
Glypicans proteins. 

amino acids 54-75, 105-157, 238-280, 309-346, 423-460, 468-506 
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FIGURE 25 

CTCGCCCTCAAATGGGAACGCTGGCCTGGGACTAAAGCATAGACCACCAGGCTGAGTATCCTGAC 
CT GAGT CATCCCCAGGGAT CAGGAGCC TCCAGCAGGGAACCT T CCAT TATATTCT T CAAGCAACT 
TACAGCTGCACCGACAGTTGCGATGAAAGTTCTAATCTCTTCCCTCCTCCTGTTGCTGCCACTAA 
TGCTGATGTCCATGGTCTCTAGCAGCCTGAATCCAGGGGTCGCCAGAGGCCACAGGGACCGAGGC 
CAGGCTTCTAGGAGATGGCTCCAGGAAGGCGGCCAAGAATGTGAGTGCAAAGATTGGTTCCTGAG 
AGCCCCGAGAAGAAAATTCATGACAGTGTCTGGGCTGCCAAAGAAGCAGTGCCCCTGTGATCATT 
TCAAGGGCAATGTGAAGAAAACAAGACACCAAAGGCACCACAGAAAGCCAAACAAGCATTCCAGA 
GCCTGCCAGCAATTTCTCAAACAATGTCAGCTAAGAAGCTTTGCTCTGCCTTT GTAG GAGCTCTG 
AGCGCCCACTCT TCCAATTAAACATTC TCAGCCAAGAAGACAGTGAGCACACCTACCAGACACTC 
TTCTTCTCCCACCTCACTCTCCCACTGTACCCACCCCTAAATCATTCCAGTGCTCTCAAAAAGCA 
TGTTTTTCAAGATCATTTTGTTTGTTGCTCTCTCTAGTGTCTTCTTCTCTCGTCAGTCTTAGCCT 
GTGCCCTCCCCTTACCCAGGCTTAGGCTTAATTACCTGAAAGATTCCAGGAAACTGTAGCTTCCT 
AGCTAGTGTCAT T TAACCT TAAATGCAATCAGGAAAGTAGCAAACAGAAGTCAATAAATATT T T T 
AAATGTCAAAAAAAAAAAAAAAAAA 
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FIGURE 26 



s 



:: E I! 



MKVLISSLLLLLPLMLMSMVSSSLNPGVARGHRDRGQASRRWLQEGGQECECKDWFLRAPRRKFM 
TVSGLPKKQCPCDHFKGNVKKTRHQRHHRKPNKHSRACQQFLKQCQLRSFALPL 

Important features: 
Signal peptide: 
amino acids 1-22 

N-myristoylation sites. 

amino acids 27-33, 46-52 



APP ID=1 006371 6 



Page 180 of 322 



FIGURE 27 



GGACGCCAGCGCCTGCAGAGGCTGAGCAGGGAAAAAGCCAGTGCCCCAGCGGAAGCACAGCTCAG 
AGCTGGTCTGCCATGGACATCCTGGTCCCACTCCTGCAGCTGCTGGTGCTGCTTCTTACCCTGCC 
CCTGCACCTCATGGCTCTGCTGGGCTGCTGGCAGCCCCTGTGCAAAAGCTACTTCCCCTACCTGA 
T GGCCGTGC TGACTCCCAAGAGCAACCGCAAGAT GGAGAGCAAGAAACGGGAGCT CT T CAGCCAG 
ATAAAGGGGCTTACAGGAGCCTCCGGGAAAGTGGCCCTACTGGAGCTGGGCTGCGGAACCGGAGC 
CAACTTTCAGTTCTACCCACCGGGCTGCAGGGTCACCTGCCTAGACCCAAATCCCCACTTTGAGA 
AGTTCCTGACAAAGAGCATGGCTGAGAACAGGCACCTCCAATATGAGCGGTTTGTGGTGGCTCCT 

H GGAGAGGACATGAGACAGCTGGCTGATGGCTCCATGGATGTGGTGGTCTGCACTCTGGTGCTGTG 

O 

£\ CTCTGTGCAGAGCCCAAGGAAGGTCCTGCAGGAGGTCCGGAGAGTACTGAGACCGGGAGGTGTGC 

fjk TCTTTTTCTGGGAGCATGTGGCAGAACCATATGGAAGCTGGGCCTTCATGTGGCAGCAAGTTTTC 

h| GAGCCCACC TGGAAACACAT TGGGGAT GGC T GC T GCCTCACCAGAGAGACCTGGAAGGAT C T TGA 

NJ GAACGCCCAGTTCTCCGAAATCCAAATGGAACGACAGCCCCCTCCCTTGAAGTGGCTACCTGTTG 

;^ GGCCCCACATCATGGGAAAGGCTGTCAAACAATCTTTCCCAAGCTCCAAGGCACTCATTTGCTCC 

if;: 'H 

T T CCCCAGCC TCCAAT TAGAACAAGCCACCCACCAGCC TATC TAT CTTCCACTGAGAGGGACCTA 

s 

p| GCAGAATGAGAGAAGACAT T CAT GTACCACCTACTAGTCCC T CT C T CCC CAACCT CT GCCAGGGC 

ff|. AATCTCTAACTTCAATCCCGCCTTCGACAGTGAAAAAGCTCTACTTCTACGCTGACCCAGGGAGG 
Cl AAACACTAGGACCCTGTTGTATCCTCAACTGCAAGTTTCTGGACTAGTCTCCCAACGTTTGCCTC 

ft? 6 

W CCAATGTTGTCCCTTTCCTTCGTTCCCATGGTAAAGCTCCTCTCGCTTTCCTCCTGAGGCTACAC 
2? CCATGCGTCTCTAGGAACTGGTCACAAAAGTCATGGTGCCTGCATCCCTGCCAAGCCCCCCTGAC 
CCTCTCTCCCCACTACCACCTTCTTCCTGAGCTGGGGGCACCAGGGAGAATCAGAGATGCTGGGG 
ATGCCAGAGCAAGACTCAAAGAGGCAGAGGTTTTGTTCTCAAATATTTTTTAATAAATAGACGAA 
ACCACG 
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FIGURE 28 



MDILVPLLQLLVLLLTLPLHLMALLGCWQPLCKSYFPYLMAVLTPKSNRKMESKKRELFSQIKGL 
TGASGKVALLELGCGTGANFQFYPPGCRVTCLDPNPHFEKFLTKSMAENRHLQYERFWAPGEDM 
RQIJUDGSMDVWCTLVLCSVQSPRKVLQETORVLRPGGVLFFWEHVAEPYGSWAFMWQQVFEPTW 
KHIGDGCCLTRETWKDLENAQFSEIQMERQPPPLKWLPVGPHIMGKAVKQSFPSSKALICSFPSL 
QLEQATHQP I YLPLRGT 

Important features: 
Signal peptide: 

amino acids 1-23 

Leucine zipper pattern. 

amino acids 10-32 

N-myristoylation sites. 

amino acids 64-70, 78-84, 80-86, 91-97, 201-207 
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FIGURE 2Q 



CAATGTTTGCCTATCCACCTCCCCCAAGCCCCTTTACCT ATGC TGCTGCTAACGCTGCTGCTGCT 
GCTGCTGCTGCTTAAAGGCTCATGCTTGGAGTGGGGACTGGTCGGTGCCCAGAAAGTCTCTTCTG 
CCACTGACGCCCCCATCAGGGATTGGGCCTTCTTTCCCCCTTCCTTTCTGTGTCTCCTGCCTCAT 
CGGCCTGCCATGACCTGCAGCCAAGCCCAGCCCCGTGGGGAAGGGGAGAAAGTGGGGGATGGCTA 
AGAAAGCTGGGAGATAGGGAACAGAAGAGGGTAGTGGGTGGGCTAGGGGGGCTGCCTTATTTAAA 
GTGGTTGTTTATGATTCTTATACTAAT TTATACAAAGATAT TAAGGCCCTGTTCAT TAAGAAA.TT 
GTTCCCTTCCCCTGTGTTCAATGTTTGTAAAGATTGTTCTGTGTAAATATGTCTTTATAATAAAC 
AGTTAAAAGCTGAAAAAAAAAAAAAAAAAAAAAAAAAAA 



EM 
ri 

li 
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FIGURE so 

MLLLTLLLLLLLLKGSCLEWGLVGAQKVSSATDAPIRDWAFFPPSFLCLLPHRPAMTCSQAQPRG 
EGEKVGDG 



III 



Important features: 
Signal peptide: 

amino acids 1-15 

Growth factor and cytokines receptors family: 

amino acids 3-18 
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FIGURE 31 

GTTTGAATTCCTTCAACTATACCCACAGTCCAAAAGCAGACTCACTGTGTCCCAGGCTACCAGTT 
CCTCCAAGCAAGTCATTTCCCTTATTTAACCGATGTGTCCCTCAAACACCTGAGTGCTACTCCCT 
AT T TGCATCT GT T T TGATAAAT GAT GT T GACACCCTCCACCGAAT T CTAAGTGGAAT CATGTCGG 
GAAGAGATACAATCCTTGGCCTGTGTATCCTCGCATTAGCCTTGTCTTTGGCCATGATGTTTACC 
TTCAGATTCATCACCACCCTTCTGGTTCACATTTTCATTTCATTGGTTATTTTGGGATTGTTGTT 
T GTCTGCGGT GT T T TATGGT GGC T GTAT TATGACTATACCAACGACCTCAGCATAGAAT TGGACA 
CAGAAAGGGAAAATATGAAGTGCGTGCTGGGGTTTGCTATCGTATCCACAGGCATCACGGCAGTG 
fnh C T GCTCGTC T TGATTT T TGT T CT CAGAAAGAGAATAAAAT TGACAGT TGAGC T T T TCCAAATCAC 

W AAATAAAGCCATCAGCAGTGCTCCCTTCCTGCTGTTCCAGCCACTGTGGACATTTGCCATCCTCA 
J! TTTTCTTCTGGGTCCTCTGGGTGGCTGTGCTGCTGAGCCTGGGAACTGCAGGAGCTGCCCAGGTT 
J!g ATGGAAGGCGGCCAAGTGGAATATAAGCCCCTTTCGGGCATTCGGTACATGTGGTCGTACCATTT 
%l AATTGGCCTCATCTGGACTAGTGAATTCATCCTTGCGTGCCAGCAAATGACTATAGCTGGGGCAG 
M TGGTTACTTGTTATTTCAACAGAAGTAAAAATGATCCTCCTGATCATCCCATCCTTTCGTCTCTC 
W s TCCATTCTCTTCTTCTACCATCAAGGAACCGTTGTGAAAGGGTCATTTTTAATCTCTGTGGTGAG 
!L GATTCCGAGAAT CATT GTCATGTACAT GCAAAACGCAC TGAAAGAACAGCAGCAT GGTGCATTGT 

|]| CCAGGTACCTGTTCCGATGCTGCTACTGCTGTTTCTGGTGTCTTGACAAATACCTGCTCCATCTC 
fT AACCAGAATGCATATAC TACAACT GCTAT TAATGGGACAGAT T TCT G TACATCAGCAAAAGATGC 

'SHI .r 

Hi AT TCAAAATC TTG T CCAAGAACT CAAGTGAC T TTACATCTAT TAACTGC T T TGGAGACT TCATAA 

fel TTTTTCTAGGAAAGGTGTTAGTGGTGTGTTTCACTGTTTTTGGAGGACTCATGGCTTTTAACTAC 
;r:?r AATCGGGCATTCCAGGTGTGGGCAGTCCCTCTGTTATTGGTAGCTTTTTTTGCCTACTTAGTAGC 
CCATAGTTTTTTATCTGTGTTTGAAACTGTGCTGGATGCACTTTTCCTGTGTTTTGCTGTTGATC 
TGGAAACAAAT GATGGATCGTCAGAAAAGCCCTACT TTATGGATCAAGAAT T T C TGAGTT TCGTA 
AAAAGGAGCAACAAAT TAAACAATGCAAGGGCACAGCAGGACAAGCACTCAT TAAGGAAT GAGGA 
GGGAACAGAAC TCCAGGCCAT TGTGAG ATAGA TACCCATT TAGGTATCTGTACC TGGAAAACAT T 
T CC T TC TAAGAGCCAT T TACAGAAT AGAAGAT GAGACCACTAGAGAAAAGT TAGTGAATT T T T T T 
T TAAAAGACC TAATAAACCC TAT TC TTCC TCAAAA 
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FIGURE 32 

MSGRDTILGLCILALALSLAI^ 

LDTEREl^CVLGFAIVSTGITAVLLVLIFVLRKRIKLTVELFQITNKAISSAPFLLFQPLWTFA 
ILIFFWVLWAVLLSLGTAGAAQVMEGGQVEYKPLSGIRYMWSYHLIGLIWTSEFILACQQMTIA 
GAWTC YFNRSKNDPPDHP ILSSLSIL FFYHQGT WKGS FL I SWRI PR 1 1 VMYMQNALKEQQHG 
ALSRYLFRCCYCCFWCLDKYLLHLNQNAYTTTAINGTDFCTSAKDAFKILSKNSSHFTSINCFGD 
FI I FLGKVLWCFTVFGGLMAFOT3S1RAFQWAVPLLLVAFFAYLVAHS FLSVFETVLDALFLCFA 
VDLETNDGSSEKPYFMDQEFLSFVKRSNKLNNARAQQDKHSLRNEEGTELQAIVR 

Important features: 
Signal peptide: 

amino acids 1-20 

Putative transmembrane domains: 

amino acids 35-54, 75-97, 126-146, 185-204, 333-350, 352-371 

N-glycosylation sites. 

amino acids 204-208, 295-299, 313-317 

N-myristoylation sites, 

amino acids 147-153, 178-184, 196-202, 296-275, 342-348 
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FIGURE 33 



GTT CGAT TAGCT CC TC TGAGAAGAAGAGAAAAGGT TCT T GGAC CT C TCCC TGT T TCT T CC T TAGA 
ATAAT TT GTATGGGAT T TGT GAT GCAGGAAAGCC TAAGGGAAAAAGAATATT CATT C TGT GTGGT 
GAAAATTTTTTGAAAAAAAAATTGCCTTCTTCAAACAAGGGTGTCATTCTGATATTTATGAGGAC 
TGTTGTTCTCACTATGAAGGCATCTGTTATTGAAATGTTCCTTGTTTTGCTGGTGACTGGAGTAC 
AT T CAAACAAAGAAACGGCAAAGAAGAT TAAAAGGCCCAAGT T CAC TG TGCC TCAGATCAAC TGC 
GAT GTCAAAGCCGGAAAGATCAT CGAT CC TGAGTT CATTGT GAAAT GT CCAGCAGGATGC CAAGA 
CCCCAAATACCATGTTTATGGCACTGACGTGTATGCATCCTACTCCAGTGTGTGTGGCGCTGCCG 
TACACAGTGGTGTGCTTGATAATTCAGGAGGGAAAATACTTGTTCGGAAGGTTGCTGGACAGTCT 
GGTTACAAAGGGAGTTATTCCAACGGTGTCCAATCGTTATCCCTACCACGATGGAGAGAATCCTT 
TAT CGTC TTAGAAAGTAAACCCAAAAAGGGTGTAACCTACCCATCAGCTC T TACATACTCATCAT 
%a , e CGAAAAGTCCAGCTGCCCAAGCAGGTGAGACCACAAAAGCCTATCAGAGGCCACCTATTCCAGGG 
p[ ACAACTGCACAGCCGGTCACTCTGATGCAGCTTCTGGCTGTCACTGTAGCTGTGGCCACCCCCAC 
CAC C T T GCCAAG GC CAT C CCC T TCT GC T GC T T CT AC CACCAGCAT C CC CAGAC CACAAT CAG T GG 
O GCCACAGGAGCCAGGAGATGGATCTCTGGTCCACTGCCACCTACACAAGCAGCCAAAACAGGCCC 
y! AGAGCTGAT CCAGGTAT CCAAAGGCAAGATCCTTCAGGAGC T GCC T T CCAGAAACCT GT T GGAGC 

Is j GGATGTCAGCCTGGGACTTGTTCCAAAAGAAGAATTGAGCACACAGTCTTTGGAGCCAGTATCCC 
TGGGAGATCCAAACTGCAAAATTGACTTGTCGTTTTTAATTGATGGGAGCACCAGCATTGGCAAA 
J' 4 CGGCGATTCCGAATCCAGAAGCAGCTCCTGGCTGATGTTGCCCAAGCTCTTGACATTGGCCCTGC 
f 8 * CGGT CCAC TGATGGGTGT TGTCCAGTAT GGAGACAACCCTGC TAC TCAC T TTAACCT CAAGACAC 

|f! ACACGAAT TC T CGAGATC TGAAGACAGCCATAGAGAAAAT TACTCAGAGAGGAGGACTT TC TAAT 

B GTAGGTCGGGCCATCTCCTTTGTGACCAAGAACTTCTTTTCCAT^AGCCAATGGAAACAGAAGCGG 
^ GGCTCCCAATGTGGTGGTGGTGATGGTGGATGGCTGGCCCACGGACAAAGTGGAGGAGGCTTCAA 
^ GACTTGCGAGAGAGTCAGGAATCAACATTTTCTTCATCACCATTGAAGGTGCTGCTGAAAATGAG 
Wl AAGCAGTAT GT GGT GGAGCCCAAC T T TGCAAACAAGGC CGTGT GCAGAACAAACGGCT T C TAC T C 

Q GCTCCACGTGCAGAGCTGGTTTGGCCTCCACAAGACCCTGCAGCCTCTGGTGAAGCGGGTCTGCG 
fh ACACTGACCGCCTGGCCTGCAGCAAGACCTGCTTGAACTCGGCTGACATTGGCTTCGTCATCGAC 
f~r H GGCTCCAGCAGTGTGGGGACGGGCAACTTCCGCACCGTCCTCCAGTTTGTGACCAACCTCACCAA 
Jjf AGAGTT TGAGAT T T CCGACACGGACACGCGCATCGGGGCCGT GCAGTACACC TACGAACAGCGGC 

M" TGGAGTTTGGGTTCGACAAGTACAGCAGCAAGCCTGACATCCTCAACGCCATCAAGAGGGTGGGC 
TACTGGAGTGGTGGCACCAGCACGGGGGCTGCCATCAACTTCGCCCTGGAGCAGCTCTTCAAGAA 
GTCCAAGCCCAACAAGAGGAAGTTAATGATCCTCATCACCGACGGGAGGTCCTACGACGACGTCC 
GGATCCCAGCCATGGCTGCCCATCTGAAGGGAGTGATCACCTATGCGATAGGCGTTGCCTGGGCT 
GCCCAAGAGGAGCT AGAAGTCAT T GCCAC TCACCCCGCCAGAGACCACTCC T T CTT TG TGGACGA 
GTTTGACAACCTCCATCAGTATGTCCCCAGGATCATCCAGAACAT TTGTACAGAGT TCAAC TCAC 
AGCCTCGGAAC TGAA T T CAGAGCAGGCAGAGCACCAGCAAGTGCT GCT T TACTAACTGACG TGT T 
GGAC CAC CC CAC C G CT TAAT GGG G CAC G C ACG GT GCAT C AAG T C T T G GGCAG GG CAT G GAGAAAC 
AAAT GT C TTGT TAT TATTCT T T GCCAT CATGCT T T T TCATATT CCAAAAC T T GGAGT T ACAAAGA 
TGAT CACAAAC GTATAGAAT GAGCCAAAAGGCTACATCATGT T GAGGGTGC T GGAGAT T T TACAT 
T T T GACAATT GT T T TCAAAATAAAT GT TCGGAATACAGTGCAGCC C T TACGACAGGC T TACGTAG 
AGCTTTTGTGAGATTTTTAAGTTGTTATTTCTGATTTGAACTCTGTAACCCTCAGCAAGTTTCAT 
TTT TGTCATGACAATGTAGGAATTGCTGAAT TAAATGTT TAGAAGGATGAAAAATAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAkAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^^ 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAG 
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FIGURE 34 

MRTWLTMKASVIEMFLVLLVTGVHSNKETAKKIKRPKFTVPQINCDVKAGKIIDPEFIVKCPAG 
CQDPKYHVYGTDVYAS YS S VCGAAVHS GVLDNSGGKI LVRKVAGQSGYKGS YSNGVQS LS LPRWR 
ESFIVLESKPKKGVTYPSALTYSSSKSPAAQAGETTKAYQRPPIPGTTAQPVTLMQLLAVTVAVA 
TPTTLPRPSPSAASTTSIPRPQSVGHRSQEMDLWSTATYTSSQNRPRADPGIQRQDPSGAAFQKP 
VGADVSLGLVPKEELSTQSLEPVSLGDPNCKIDLSFLIDGSTSIGKRRFRIQKQLLADVAQALDI 
GPAGPLMGWQYGDNPATHFNLKTHTNSRDLKTAIEKITQRGGLSNVGRAISFVTKNFFSKANGN 
RSGAPNVVVVMVDGWPTDKVEEASRLARESGINI FFIT IEGAAENEKQYVVEPNFANKAVCRTNG 
yi FYSLHVQSWFGLHKTLQPLVKRVCDTDRIiACSKTCLNSADIGFVIDGSSSVGTGNFRTVLQFVTN 
CI LTKEFEISDTDTRIGAVQYTYEQRLEFGFDKYSSKPDILNAIKRVGYWSGGTSTGAAINFALEQL 
W FKKSKPNKRKLMI LITDGRS YDDVRI PAMAAHLKGVI TYAIGVAWAAQEELEVIATHPARDHS FF 

f \ VDE FDNLHQYVPRI IQNICTEFNSQPRN 

Si ' 

f4 Important features: 

If? Signal peptide: 

!L amino acids 1-26 

II! 

•P! Transmembrane domain: 

III amino acids 181-200 

fi s- 

*** N-glycosylation sites. 

amino acids 390-394, 520-524 



N-myristoylation sites. 

amino acids 23-29, 93-99, 115-121, 262-268, 367-373, 389-395, 
431-437, 466-472, 509-515, 570-576, 571-577, 575-581, 627-633 

Amidation site. 

amino acids 304-308 
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FIGURE as 



CC GAGCAC AGGAGATT GCCT GCGTTT AGGAGGTGGCTGCGTT GT GGGAAAAGCT AT CAAGGAAGAAATTGC 
CAAACCAT GT CTT T TT TT CT GT TT TC AG AGTAGT TC ACAACAGATCTG AGTGTTTT AATT AAGC AT GGAAT 
AC AGAAAACAACAAAAAACT TAAGCT TT AATT TC AT CT GGAATT CC AC AGTT TT CT T AGCTCCC TGGACCC 
GGTTGACCTGTTGGCTCTTCCCGCTGGCTGCTCTATCACGTGGTGCTCTCCGACTACTCACCCCGAGTGTA 
AAGAACCTTCGGCTCGCGTGCTTCTGAGCTGCTGTGGATGGCCTCGGCTCTCTGGACTGTCCTTCCGAGTA 
GG AT GTCACT GAGATC CCTC AAATGGAGCCT C CT GCT GCTGT CACT CCT G AG TTTCTTTGTGAT GTGGT AC 
CTCAGCCTTCCCCACTACAATGTGATAGAACGCGTGAACTGGATGTACTTCTATGAGTATGAGCCGATTTA 
CAGACAAGACTTTC ACT TCACACTTCGAGAGC ATTCAAACTGCT CT CAT C AAAATC CATT TCTGGT CATT C 
gw* TGGT GAC C TCCCACCCT TC AGATGTGAAAGCC AGGC AGGCCATT AGAGT T AC TTGGGGTG AAAAAAAGTCT 

f" % TGGTGGGGAT ATGAGGT TCT TACATT TT TCT T ATTAGGCC AAGAGGCT GAAAAGGAAGAC AAAATGT TGGC 



AT TGTCCT TAGAGG AT GAAC ACCTTCTT T AT GGT GAC ATAAT CC GACAAGAT TTTT TAGACACAT AT AAT A 
hi ACCT GACC TT GAAAACC AT TAT GGCATT CAGGTGGGT AACTGAG TTTTGCCC CAAT GCCAAGTACGT AAT G 

M AAGACAGACACT GATGTTT T C ATC AATACTGGCAATT T AGTGAAGT AT CT T T TAAACCTAAACC ACT C AGA 

GAAGTTT T TCACAGGT TAT C CT CTAATTGAT AATTATT CCT AT AGAGGATTT T AC C AAAAAACCCAT ATT T 

ft I 

CT TACCAGGAGT AT CCTTT C AAGGTGTT CCCT CC AT ACTGCAGT GGGT TGGGT T AT AT AATGTCCAGAGAT 
JL,. TT GGTGCC AAGGAT CT ATGAAATG AT GGGTCACGTAAAACCC AT CAAGTTTG AAGATGTT T AT GTCGGGAT 

|2 CTGTTTGAATTTATTAAAAGTGAACATTCATATTCCAGAAGACACAAATCTTTTCTTTCTATATAGAATCC 

ATTT GGAT GT CTGTCAACT G AG ACGT GT GATTGC AGCCCATGGC TT TT CTTCC AAGGAGAT CAT CACTTT T 

Iras' 

fjff TGGCAGGT CATGCT AAGGAACAC C AC AT GCCATT AT TAACTT CACATT CT AC AAAAAGCCT AGAAGGACAG 

f : ! GATACCTTGTGGAAAGTGTTAAATAAAGTAGGTACTGTGGAAAATTCATGGGGAGGTCAGTGTGCTGGCTT 
flf AC ACTGAACTGAAACT CATGAAAAAC CCAGACTGGAGACT GGAGGGTT AC ACT T GT GATT TATT AGTC AGG 

CCCTTCAAAGATGATATGTGGAGGAATTAAATATAAAGGAATTGGAGGTTTTTGCTAAAGAAATTAATAGG 
AC CAAACAAT T TGGAC AT GT CATTCT GT AGAC TAGAAT TT CT T AAAAGGGT GTT ACTGAGTT AT AAGCTC A 
CT AGGCT GTAAAAACAAAAC AATGT AGAGTTT TATTT ATT GAAC AATGT AGT CACT TGAAGGTT TT GT GT A 
T ATCTT AT GT GGAT TACC AATT TAAAAAT AT ATGTAGT TCTGT GTCAAAAAACT TC TT CACTGAAGTT AT A 
CT GAACAAAAT TTT ACCT GTTTT TGGTC ATT T AT AAAGTACT T C AAGAT GTT GCAGT ATT TC AC AGTT AT T 
ATTATTTAAAATTACTTCAACTTTGTGTTTTTAAATGTTTTGACGATTTCAATACAAGATAAAAAGGATAG 
T GAATCAT TCT TTACATGC AAAC AT T TTCCAGTT ACTT AACT GATCAGTT TATTAT TGAT AC AT CACT CCA 
TTAATGTAAAGTCATAGGTCATTATTGCATATCAGTAATCTCTTGGACTTTGTTAAATATTTTACTGTGGT 
AATATAGAGAAGAATTAAAGCAAGAAAATCTGAAAA 
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FIGURE 36 



MMALWTVLPSRMSLRSLKWSLLLLSLLSF.FVMWYLSLPHYNVIERVNWMYFYEY 

TLREHSNCSHQNPFLVILVTSHPSDVKARQAIRVTWGEKKSWWGYEVLTFFLLGQEAEKEDKMLA 

LSLEDEHLLYGDIIRQDFLDTYNNLTLKTIMAFRWVTEFCPNAKYVMKTDTDVFINTGNLVKYLL 

NLNHSEKFFTGYPLIDNYSYRGFYQKTHISYQEYPFKVFPPYCSGLGYIMSRDLVPRIYEMMGHV 

KP I KFEDVYVGI CLNLLKVNIH I PEDTNL FFLYRI HLDVCQLRRVI AAHGFS SKE 1 1 T FWQVMLR 

NTTCHY 

Important features: 

Type II transmembrane domain: 

amino acids 20-39 

N-glycosylation sites. 

amino acids 72-76, 154-158, 198-202, 212-216, 326-330 

Glycosaminoglycan attachment site, 

amino acids 239-243 

Ly-6 / u-PAR domain proteins. 

amino acids 23-37 

N-myristoylation site. 

amino acids 271-277 
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FIGURE 37 



CGCTCGGGCACCAGC CGCGGCAAGGAT GGAGC TGGGT TGC TGGAC GCAGTT GGGGC T CAC TTT TC TTCAGC TCC TTC TCATC 
T CGTCCT TGCCAAGAGAGTACACAGTCAT TAATGAAGCC TGCCC TGGAGGkGAGTGGAATATCATGTGTCGGGAGTGCTGTG 
AATATGATCAGATTGAGTGCGTCTGCCCCGGAAAGAGGGAAGTCGTGGGTTATACCATCCCTTGCTGCAGGAATGAGGAGAA 
TGAGTGTGACTCCTGCCTGATCCACCCAGGTTGTACCATCTTTGAAAACTGCAAGAGCTGCCGAAATGGCTCATGGGGGGGT 
ACCTTGGATGAC TTCTATGTGAAGGGGT TCTAC TGT GCAGAGTGCCGAGCAGGCTGGTACGGAGGAGAC TGCATGCGAT GTG 
GCCAGGT TC TGCGAGCCCCAAAGGGTCAGATT TTGT T GGAAAGC TATC CCC TAAATGCTCACT GTGAAT GGACCAT TCATGC 
TAAACCTGGGT TTGTCATCCAAC TAAGAT TTGTCAT GTTGAGTC TGGAGT T TGAC TACATGTGC CAGTATGACTATGTT GAG 
GT TCGTGATGGAGACAACCGCGAT GGCCAGATCATC AAGCGTGTCTGT GGCAACGAGCGGCCAGC TCCTATCCAGAGCATAG 
GATCCTCACTCCACGTCCTCTTCCACTCCGATGGCTCCAAGAATTTTGACGGTTTCCATGCCATTTATGAGGAGATCACAGC 
[«,], ATGCTCCTCATCCCCTTGTTTCCATGACGGCACGTGCGTCCTTGACAAGGCTGGATCTTACAAGTGTGCCTGCTTGGCAGGC 
I s ! TATACTGGGCAGCGC TGTGAAAATC TC C T TGAAGAAAGAAACTGC TCAGAC CC TGGGGGC C CAGT CAATGGGTACCAGAAAA 

'•fcSfr 

f a l TAACAGGGGGCCCTGGGCTTATCAACGGACGCCATGC TAAAAT T GGCAC CGT GGT GTCTT TCT T T TGTAACAAC TCC TATGT 

g\ TC TTAGTGGCAATGAGAAAAGAAC TTGCCAGCAGAAT GGAGAGT GGTC AGGGAAACAGC C CAT C T GCATAAAAGCC TGC CGA 

I I GAACCAAAGAT TTCAGACCTGGT GAGAAGGAGAGT TC TTCCGATGCAGGTTC AGTCAAGGGAGACACCATTACACCAGC TAT 

Wit 

\J ACTCAGCGGCC TTCAGCAAGCAGAAAC TGCAGAGTGC CCCTACCAAGAAGCCAGC CC TT C CCT T T GGAGATCTGCCCATGGG 

5 I' ATAC CAAC ATC TGCATACCCAGC TC CAGTATGAGTGC ATCTCAC C CTTCTAC CGC C GCCTGGGCAGCAGCAGGAGGACAT GT 

m C TGAGGACTGGGAAGTGGAGTGGGCGGGCACC ATCC TGCATCCCT ATCTGCGGGAAAATT GAGAACATCACTGCTCCAAAGA 

CC CAAGGGTTGCGCTGGCCGTGGC AGGCAGCC ATCTACAGGAGGACCAGCGGGGTGCATGACGGCAGC C TAC ACAAGGGAGC 

^ gtggttcctagtctgcagcggtgccctggtgaatgagcgcactgtggtggtggctgcccactgtgttactgacctggggaag 
jfs gtcaccatgatcaagacagcagacc tgaaagttgt t t tggggaaattctac cgggatgat gac c gggatgagaagaccatcc 

agagcctacagatttctgctatcattctgcatcccaactatgaccccatcctgcttgatgctgacatcgccatcctgaagct 
cctagacaaggcccgtatcagcacccgagtccagccc atctgc ctcgctgc c agt c gggatct cagcac ttcc ttccaggag 
tcccacat cac tgtggctggc tggaat gtcctggcagacgtgaggagc cctggcttcaagaac gacacactgcgctc tgggg 
tggt cagtgtgg tggac tcgc tgct gt g tgaggagcagcatgaggacc atggcat c ccagtgagt gtcactgataacatgt t 
ctgtgccagctgggaacccactgccccttctgatatctgcactgcagagacaggaggcatcgcggctgtgtccttcccggga 
cgagcatc t cc tgagcc acgc tggc atc tgatgggac tggtcagctggagc tatgataaaacat gcagccacaggc tctcca 
ctgccttcaccaaggtgctgccttttaaagactggattgaaagaaat^^ 

tgtttctgtatatccgtctgtacgtgtgtcattgcgtgaagcagtgtgggcctgaagtgtgatttggcctgtgaacttggct 
gtgccagggcttctgacttcagggac^aaactcagtgaagggtgagtagacctccattgctggtaggctgatgccgcgtcca 
c tac taggacagccaattggaagat gcc agggcttgcaagaagtaagtt tc t tcaaagaagac catatacaaaacc tc tcc a 
c tc cac tgacc tggtggtct tcc ccaact ttcagttatacgaatgccat cagc t tgaccagggaagatc tgggctt catgag 

GCCCCTTTTGAGGCTCTCAAGTTCTAGAGAGCTGCCTGTGGGACAGCCCAGGGCAGCAGAGCTGGGATGTGGTGCATGCCTT 
TGTGTACATGGCC AC AGTAC AGT CTGGTCCTTTTCC T TCCCCATCTCT T GTACACAT T T TAATAAAATAAGGGTTGGCTTC T 
GAAC TAC AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 3 8 

MELGCWTQLGLTFLQLLLISSLPREYTVINEACPGAEWNIMCRECCEYDQIECVCPGKREWGYT 
IPCCRNEENECDSCLIHPGCTIFENCKSCRNGSWGGTLDDFYVKGFYCAECRAGWYGGDCMRCGQ 
VLRAPKGQILLESYPLNAHCEWTIHAKPGFV"IQLRFVMLSLEFDYMCQYDYVEVRDGDNRDGQII 
KRVCGNERPAP I QS I GS S LHVL FHSDGS KNFDGFHAI YEE I TACS S S PC FHDGTCVLDKAGS YKC 
ACLAGYTGQRCENLLEERNC SDPGGPVNGYQKI TGGPGL INGRRAKI GT WS FFCNNS YVXjSGNE 
KRTCQQNGEWSGKQPICIKACREPKISDLVRRRVLPMQVQSRETPLHQLYSAAFSKQKLQSAPTK 
KPALPFGDLPMGYQHLHTQLQYECISPFYRRLGSSRRTCLRTGKWSGRAPSCIPICGKIENITAP 
KT QGLRWPWQAAI YRRTSGVHDGS LHKGAWFLVCS GALYNERTVVVAAHCVTDLGKVTMIKTADL 
KWLGKFYRDDDRDEKT I QSLQI S AI I LHPNYDP I LLDAD I AI LKLLDKARI S TRVQP ICLAASR 
DLSTSFQESHITVAGWNVLADVRSPGFKNDTLRSGWSWDSLLCEEQHEDHGIPVSVTDNMFCA 
SWEPTAPSDICTAETGGIAAVSFPGRASPEPRWHLMGLVSWSYDKTCSHRLSTAFTKVLPFKDWI 
ERNMK 

Important features of the protein: 
Signal peptide: 

amino acids 1-23 

EGF-like domain cysteine pattern signature, 
amino acids 260-272 
N-glycosylation sites. 

amino acids 96-100, 279-283, 316-320, 451-455, 614-618 
N-myristoylation sites. 

amino acids 35-41, 97-103, 256-262, 284-290, 298-304, 308-314, 

474-480, 491-497, 638-644, 666-672 

Amidation site. 

amino acids 56-60 

Serine proteases, trypsin family. 

amino acids 489-506 

CUB domain proteins profile. 

amino acids 150-167 
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FIGURE 3Q 



HI 



GGTT CCTACAT CCT CT CAT CT GAGAAT CAGAGAGCATAAT CTTCTTACGGGC CC GT GATT TATTAACGTGGCTT AAT C 
T GAAGGTTCT CAGTCAAATT CTTT GT GAT CTACT GATT GT GGGGGCATGGCAAGGT TTGCTTAAAG GAGC TT GGCT GG 
TTTGGGCCCTTGTAGCTGACAGAAGGTGGCCAGGGAGAATGCAGCACACTGCTCGGAGAATGAAGGCGCTTCTGTTGC 
T GGT CTTGCCTT GGCTCAGT CCTGCTAACTACAT T GACAAT GTGGGCAACCT GCACTTC CT GTAT TCAGAACTCT GTA 
AAGGTGCCT CC CACTACGGCCT GAC CAAAGATAGGAAGAGGC GCT CACAAGATGG CT GT CCAGACGGCT GT GCGAGC C 
T CACAGC CACGGCT CCCT C CCCAGAG GTTT CT GCAGCTGCCACCAT CTCCTTAAT GACAGACGAGC CTGGCCTAGACA 
ACCCTGC CTACGTGT C CTC GGCAGAGGACGGGCAGCC7VGCAATCAGCCCAGTGGACTCT GGCCGGAGCAACCGAACTA 
GGGCACGGC CCTTT GAGAGAT CCACTATTAGAAGCAGAT CATTTAAAAAAATAAAT CGAGCTTTGAGTGTTCTT CGAA 
GGACAAAGAGCGGGAGT GCAGTTGCCAACCAT GCCGACCAG GGCAGGGAAAATTCT GAAAACACCACTGCCCCTGAAG 
T CTTT CCAAGGTTGTACCACCTGATT CCAGAT GGTGAAATTACCAGCATCAAGAT CAAT CGAGTAGATC CCAGT GAAA 
GCCT CTCTATTAGGCT GGT GGGAGGTAGCGAAACCCCACT G GTCCATAT CATTAT CCAACACATTTAT CGT GATGGGG 
T GAT CGC CAGAGAC GGCCGGCTACTGCCAGGAGACAT CATT CTAAAGGTCAACGGGATGGACATCAGCAAT GTCCCT C 
ACAACTACGCTGTGCGTCTCCTGCGGCAGCCCTGCCAGGTGCTGTGGCTGACTGTGATGCGTGAACAGAAGTTCCGCA 
GCAGGAACAATGGACAGGCCCCGGAT GCCTACAGACC CCGAGAT GACAGCT TTCAT GTGATT CT CAACAAAAGTAGCC 
CCGAGGAGCAGCTTGGAATAAAACTGGTGCGCAAGGTGGATGAGCCTGGGGTTTTCATCTTCAATGTGCTGGATGGCG 
GT GTGGCATATCGACAT GGT CAGCTT GAGGAGAAT GACCGT GTGTTAGCCATCAAT GGACAT GAT CT TCGATAT GGCA 
GCCCAGAAAGTGCGGCTCATCTGATTCAGGCCAGTGAAAGACGTGTTCACCTCGTCGTGTCCCGCCAGGTTCGGCAGC 
GGAGCC CTGACATCTTT CAGGAAGCC GGCT GGAACAGCAAT GGCAGCTG GT CCC CAGGGCCAGGGGAGAGGAGCAACA 
CT CC CAAGCC C CTCCATCCTACAATTACT T GT CAT GAGAAGGTGGTAAATAT CCAAAAAGACCCC GGT GAAT CT CTCG 
GCAT GACCGT C GCAGGG GGAGCATCACATAGAGAAT GGGATT TGC CTAT CTATGT CATCAGT GTT GAGCCCGGAGGAG 
T CATAAGCAGAGAT GGAAGAATAAAAACAGGT GACATTTT GT TGAAT GT GGATGGGGTCGAACT GACAGAGGT CAGCC 
GGAGT GAGGCAGTGGCATTATT GAAAAGAACATCATC CTCGATAGTACT CAAAG CTTTGGAAGT CAAAGAGTATGAGC 
CCCAGGAAGACT GCAGCAGCCCAGCAGCCCT GGACTC CAAC CACAACAT GGC CC CACCCAGT GACTGGT CCC CAT CCT 
GGGT CAT GT GGCTGGAATTACCACG GT GCTTGTAT AACTGTAAAGATATT GTATTACGAAGAAACACAGCTG GAAGT C 
TG GGCTTCT GCATT GTAGGAGGTTAT GAAGAATACAAT GGAAACAAACCTT TTTT CAT CAAAT CCATT GTT GAAGGAA 
CACCAGCATACAAT GAT GGAAGAATTAGAT GTGGT GATATT CTT CTTGCT GTCAAT GGTAGAAGTACAT CAGGAAT GA 
TACAT GCTT GCTTGGCAAGACT GCTGAAAGAACT TAAAGGAAGAATTACTCTAACTATT GTT TCT T GGCCTGGCACT T 
TTTT ATAGA AT CAAT GATGGGT CAGAGGAAAACAGAAAAATCACAAATAG GCTAAGAAGTT GAAACACTAT ATTTAT C 
TTGT CAGTTT TTATATTTAAAGAAAGAATACATT GTAAAAAT GT CAGGAAAAGTATGAT CAT CTAAT GAAAGCCAGT T 
ACACCT CAGAAAATAT GAT T CCAAAAAAATTAAAACTACTAGTTTTT TTT CAGT GT GGAGGATTT CT CATTACT CTAC 
AACATT GTTTATATT TTTTCTATT CAATAAAAAGCCCTAAAACAACTAAAAT GATT GATTT GTATACCCCACT GAATT 
CAAGCTGATTTAAATTTAAAATTTGGTATATGCTGAAGTCTGCCAAGGGTACATTATGGCCATTTTTAATTTACAGCT 
AAAATAT TTT TTAAAAT GCATTGCT GAGAAACGTTGCTT T CATCAAACAAGAATAAATATTTTT CAGAAGTTAAA 
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FIGURE AO 



MKALLLLVLPWLSPANYIDNVGNLHFLYSELCKGASHYGLTKDRKRRSQDGCPDGCASLTATAPS 
PEVSAAATISLMTDEPGLDNPAYVSSAEDGQPAISPVDSGRSNRTRARPFERSTIRSRSFKKINR 
ALSVLRRTKSGSAVANHADQGRENSENTTAPEVFPRLYHLIPDGEITSIKINRVDPSESLSIRLV 
GGSETPLVHI IIQHIYRDGVIARDGRLLPGDI ILKVNGMDISNVPHNYAVRLLRQPCQVLWLTVM 
REQKFRSRNNGQAPDAYRPRDDS FHVILNKS SPEEQLGIKLVRKVDEPGVFI FNVLDGGVAYRHG 
QLEENDRVLAINGHDLRYGSPESAAHLIQASERRVHLVVSRQVRQRSPDIFQEAGWNSNGSWSPG 
PGERSNTPKPLHPTITCHEKWNIQKDPGESLGMTVAGGASHREWDLPIYVISVEPGGVISRDGR 
IKTGDILLNVDGVELTEVSRSEAVALLKRTSSSIVLKALEVKEYEPQEDCSSPAALDSNHNMAPP 
SDWSPSWVMWLELPRCLYNCKDIVLRRNTAGSLGFCIVGGYEEYNGNKPFFIKSIVEGTPAYNDG 
RIRCGDILLAVNGRSTSGMIHACLARLLKELKGRITLTIVSWPGTFL 

Important features: 
Signal peptide: 
amino acids 1-15 

N-glycosylation sites. 

amino acids 108-112, 157-161, 289-293, 384-388 

Tyrosine kinase phosphorylation sites. 

amino acids 433-441, 492-500 

N-myristoylation sites. 

amino acids 51-57, 141-147, 233-239, 344-350, 423-429, 447-453, 
467-473, 603-609 
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FIGURE 41 



ACCAGGCATTGTATCT T CAGTTGT CAT CAAGTTCGCAAT CAGAT TGGAAAAGCTCAACT TGAAGCTTT 
CTT GCCTGCAGT GAAGCAGAGAGATAGATAT TATT CACGTAATAAAAAACATGGGCTTCAACCTGAC T 
TTCCACCTTTCCTACAAATTCCGATTACTGTTGCTGTTGACTTTGTGCCTGACAGTGGTTGGGTGGGC 
CACCAGTAACTACTTCGTGGGTGCCATTCAAGAGATTCCTAAAGCAAAGGAGTTCATGGCTAATTTCC 
ATAAGACCCTCAT TTT GGGGAAGGGAAAAACTCTGACTAATGAAGCAT CCACGAAGAAGGTAGAACT T 
GACAACTGTCCTTCTGTGTCTCCTTACCTCAGAGGCCAGAGCAAGCTCATTTTCAAACCAGATCTCAC 
TTTGGAAGAGGTACAGGCAGAAAATCCCAAAGTGTCCAGAGGCCGGTATCGCCCTCAGGAATGTAAAG 
CTTTACAGAGGGTCGCCATCCTCGTTCCCCACCGGAACAGAGAGAAACACCTGATGTACCTGCTGGAA 
III CAT CT GCAT CCCT TCCTGCAGAGGCAGCAGCTGGATTAT GGCATCTACGTCATCCACCAGGCTGAAGG 

pi TAAAAAGTTTAATCGAGCCAAACTCTTGAATGTGGGCTATCTAGAAGCCCTC^^ 
fj| ACTGCTTTATATTCCACGATGTGGACCTGGTACCCGAGAATGACTTTAACCTTTACAAGTGTGAGGAG 
||| CATCCCAAGCATCTGGTGGTTGGCAGGAACAGCACTGGGTACAGGTTACGTTACAGTGGATATTTTGG 
\1 GGGTGTTACTGCCCTAAGCAGAGAGCAGTTTTTCAAGGTGAATGGATTCTCTAACAACTACTGGGGAT 

H GGGGAGGCGAAGACGATGACCTCAGACTCAGGGTTGAGCTCCAAAGAATGAAAATTTCCCGGCCCCTG 

If* 

CCTGAAGTGGGTAAATATACAATGGTCTTCCACACTAGAGACAAAGGCAATGAGGTGAACGCAGAACG 
|U GATGAAGCTCTTACACCAAGTGTCACGAGTCTGGAGAACAGATGGGTTGAGTAGTTGTTCTTATAAAT 
I'll TAGTATCTGTGGAACACAATCCTTTATATATCAACATCACAGTGGATTTCTGGTTTGGTGCATGACCC 
*f| TGGATCTTTTGGTGATGTTTGGAAGAACTGATTCTTTGTTTGCAATAATTTTGGCCTAGAGACTTCAA 

pl atagtaggacacattaagaacctgttacagctcattgttgagctgaattt^^ 
w tagcagagctcctggtgatgtagagtataaaacagttgtaacaagacagctttcttagtcattttgat 
ft? cat gagggt taaatat tgtaatatggatact tgaaggactt tatataaaaggatgactcaaaggataa 

aatgaacgctatttgaggactctggttgaaggagatttatttaaatttgaagtaatatattatgggat 
aaaaggccacaggaaataagactgctgaatgtctgagagaaccagagttgttctcgtccaaggtagaa 
aggtacgaagatacaatactgttattcatttatcctgtacaatcatctgtgaagtggtggtgtcaggt 
gagaaggcgtccacaaaagaggggagaaaaggcgacgaatcaggacacagtgaacttgggaatgaaga 
ggtagcaggagggtggagtgtcggctgcaaaggcagcagtagctgagctggttgcaggtgctgatagc 
cttcaggggaggacctgcccaggtatgccttccagtgatgcccaccagagaatacattctctattagt 
ttttaaagagtttttgtaaaatgattttgtacaagtaggatatgaattagcagtttacaagtttacat 
attaactaataataaatatgtctatcaaatacctctgtagtaaaatgtgaaaaagcaaaa 
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FIGURE 42 

MGFNLT FHLS YKFRLLLLLTLCLT WGWAT SNYFVGAI QE I PKAKE FMANFHKTL I LGKGKTLTN 
EASTPCKVELDNCPSVSPYLRGQSKLIFKPDLTLEEVQAENPKVSRGRYRPQECKZ^LQRVAILVPH 
RNREKHMYLLEHLHPFLQRQQLDYGIWIHQAEGKKFNRAKLLNVGYLEALKEENWDCFIFHDV 
DLVPENDFNLYKCEEHPKHLWGRNSTGYRLRYSGYFGGVTALSREQFFKVNGFSNNYWGWGGED 
DDLRLRVELQRMKISRPLPEVGKYTMVFHTRDKGNEVNAERMKLLHQVSRVWRTDGLSSCSYKLV 
SVEHNPLYINITVDFWFGA 

Important features: 
Signal peptide: 

amino acids 1-27 

N-glycosylation sites. 

amino acids 4-8, 220-224, 335-339 

Xylose isomerase proteins. 

amino acids 191-202 
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FIGURE 43 

GCTCAAGACCCAGCAGTGGGACAGCCAGACAGACGGCACGATGGCACTGAGCTCCCAGATCTGGG 
CCGCTTGCCTCCTGCTCCTCCTCCTCCTCGCCAGCCTGACCAGTGGCTCTGTTTTCCCACAACAG 
ACGGGACAAC T T GCAGAGC TGCAACCCCAGGACAGAGC TGGAGCCAGGGCCAGCT GGATGCCCAT 
GTTCCAGAGGCGAAGGAGGCGAGACACCCACTTCCCCATCTGCATTTTCTGCTGCGGCTGCTGTC 
ATCGATCAAAGTGTGGGATGTGCTGCAAGACGTAGAACCTACCTGCCCTGCCCCCGTCCCCTCCC 
TTCCTTATTTATTCCTGCTGCCCCAGAACATAGGTCTTGGAATAAAATGGCTGGTTCTTTTGTTT 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 44 

MALSSQIWAACLLLLLLIASLTSGSVFPQQTGQLAELQPQDRAGARASWMPMFQRRRRRDTHFPI 
CI FCCGCCHRSKCGMCCKT 

Important features: 
Signal peptide; 

amino acids 1-24 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 58-59 

N-myristoylation site, 

amino acids 44-50 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 1-12 
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FIGURE 



GT GGC T TCAT T T CAGT GGCTGACTT CCAGAGAGCAATATGGC T GGT TCCCCAACAT GCC TCACCC 
TCATCTATATCCTTTGGCAGCTGACAGGGTCAGCAGCCTCTGGACCCGTGAAAGAGCTGGTCGGT 
TCCGTTGGTGGGGCCGTGACTTTCCCCCTGAAGTCCAAAGTAAAGCAAGTTGACTCTATTGTCTG 
GACC T T CAACACAACCCC TC T TGTCACCATACAGCCAGAAGGGGGCAC TATCATAGT GACCCAAA 
ATCGTAATAGGGAGAGAGTAGAC T T CCCAGAT GGAGGC TACT CCCT GAAGCTCAGCAAAC TGAAG 
AAGAATGACTCAGGGATCTACTATGTGGGGATATACAGCTCATCACTCCAGCAGCCCTCCACCCA 
GGAGTACGTGCTGCATGTCTACGAGCACCTGTCAAAGCCTAAAGTCACCATGGGTCTGCAGAGCA 
ATAAGAATGGCACC TGTGTGACCAATC TGACATGCT GCATGGAACATGGGGAAGAGGATGTGAT T 
TATACCTGGAAGGCCCTGGGGCAAGCAGCCAATGAGTCCCATAATGGGTCCATCCTCCCCATCTC 
CTGGAGATGGGGAGAAAGTGATATGACCTTCATCTGCGTTGCCAGGAACCCTGTCAGCAGAAACT 
TCTCAAGCCCCATCCTTGCCAGGAAGCTCTGTGAAGGTGCTGCTGATGACCCAGATTCCTCCATG 
GTCCTCCTGTGTCTCCTGTTGGTGCCCCTCCTGCTCAGTCTCTTTGTACTGGGGCTATTTCTTTG 
GTTTCTGAAGAGAGAGAGACAAGAAGAGTACATTGAAGAGAAGAAGAGAGTGGACATTTGTCGGG 
£f| AAACTCC TAACATATGCCCCCAT TCTGGAGAGAACACAGAG TACGACACAATCCCT CACACTAAT 

» AGAACAATCCTAAAGGAAGATCCAGCAAATACGGTTTACTCCAGTGTGGAAATACCGAAAAAGAT 
GGAAAATCCCCAC T CACTGC TCACGAT GCCAGACACACCAAGGCTATT TGCC TATGAGAATGT TA 
%! TCTAGACAGCAGTGCACTCCCCTAAGTCTCTGCTCA 

a 



m 

5,5' 

""I 
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FIGURE 46 



MAGSPTCLTLIYILWQLTGSAASGPVKELVGSVGGAVT FPLKSKVKQVDSIVWTFNTTPLVTIQP 
EGGTIIVTQNRNRERVDFPDGGYSLKLSKLKKNDSGIYYVGIYSSSLQQPSTQEYVLHVYEHLSK 
PKVTMGLQSNKNGTCVTNLTCCMEHGEEDVIYTWKALGQAANESHNGSILPISWRWGESDMTFIC 
VARNPVS RNFSSPI LARKLCEGAADDPDS SMVLLCLLLVPLLLSLFVLGL FLWFLKRERQEE Y I E 
EKKRVDICRETPNICPHSGENTEYDTIPHTNRTILKEDPANTVYSTVEIPKKMENPHSLLTMPDT 
PRLFAYENVI 

Important features: 
Signal peptide: 

amino acids 1-22 

Transmembrane domain: 

amino acids 224-250 

Leucine zipper pattern. 

amino acids 229-251 

N-glycosylation sites. 

amino acids 98-102, 142-146, 148-152, 172-176, 176-180, 204-208, 
291-295 
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FIGURE 47 



GGCTCGAGCGTTTCTGAGCCAGGGGTGACCATGACCTGCTGCGAAGGATGGACATCCTGCAATGG 
ATTCAGCCTGCTGGTTCTACTGCTGTTAGGAGTAGTTCTCAATGCGATACCTCTAATTGTCAGCT 
TAGTTGAGGAAGACCAATTTTCTCAAAACCCCATCTCTTGCTTTGAGTGGTGGTTCCCAGGAATT 
AT AGGAGCAG GT C T GAT GGC CAT T CCAGCAACAACAAT G T C C T T GACAG C AAGAAAAAGAGC G T G 
CT GCAACAACAGAACT GGAATGT T T CT T TCAT CAT TT TTCAGT GT GAT CACAGTCAT T GGTGC TC 
TGTATTGCATGCTGATATCCATCCAGGCTCTCTTAAAAGGTCCTCTCATGTGTAATTCTCCAAGC 
AACAGTAATGCCAATTG T GAAT T T TCAT TGAAAAACATCAGTGACATTCATCCAGAATCC T TCAA 
CTTGCAGTGGTTTTTCAATGACTCTTGTGCACCTCCTACTGGTTTCAATAAACCCACCAGTAACG 
ACACCAT GGCGAGT GGC TGGAGAGCATC TAGT T TCCAC T TCGAT T C TGAAGAAAACAAACATAGG 
CTTATCCACTTCTCAGTATTTTTAGGTCTATTGCTTGTTGGAATTCTGGAGGTCCTGTTTGGGCT 
CAGTCAGATAGTCATCGGTTTCCTTGGCTGTCTGTGTGGAGTCTCTAAGCGAAGAAGTCAAATTG 
TGTAGTT TAATGGGAATAAAATGTAAGTATCAGTAGT TTGAAAAAAAAAAA 

il 
« 

m 



fll 
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FIGURE 48 



MTCCEGWTSCNGFSLLVLLLLGWLNAIPLIVSLVEEDQFSQNPISCFEWWFPGIIGAGLMAIPA 
TTMSLTARKRACCNNRTGMFLSSFFSVITVIGALYCMLISIQALLKGPLMCNSPSNSNANCEFSL 
KNISDIHPESFNLQWFFNDSCAPPTGFNKPTSNDTMASGWRASSFHFDSEENKHRLIHFSVFLGL 
LLVGILEVLFGLSQIVIGFLGCLCGVSKRRSQIV 

Important features: 
Transmembrane domains: 

amino acids 10-31 (type II), 50-72, . 87-110, 191-213 
N-glycosylation sites. 

amino acids 80-84, 132-136, 148-152, 163-167 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 223-227 

N-myristoylation sites . 

amino acids 22-28, 54-60, 83-89, 97-103, 216-222 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 207-218 

TNFR/NGFR family cysteine-rich region protein. 

amino acids 4-12 
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FIGURE 4Q 



ATCCGTTCTCTGCGCTGCCAGCTCAGGTGAGCCCTCGCCAAGGTGACCTCGCAGGACACTGGTGA 
AGGAGCAGTGAGGAACC TGCAGAGTCACACAGTT GCT GACCAAT T GAGC TG T GAGC CT GGAGCAG 
ATCCGTGGGCTGCAGACCCCCGCCCCAGTGCCTCTCCCCCTGCAGCCCTGCCCCTCGAACTGTGA 
C ATGG AGAGAGTGACCCTGGCCCTTCTCCTACTGGCAGGCCTGACTGCCTTGGAAGCCAATGACC 
CAT T TGCCAATAAAGACGATCCC T TCTACTAT GACTGGAAAAACC T GCAGC TGAGCGGAC TGAT C 
TGCGGAGGGCTCCTGGCCATTGCTGGGATCGCGGCAGTTCTGAGTGGCAAATGCAAATACAAGAG 
CAGCCAGAAGCAGCACAGTCCTGTACCTGAGAAGGCCATCCCACTCATCACTCCAGGCTCTGCCA 
C TACT T GC TGAG CACAGGAC T GG C C T C CAGGGAT GG C C T GAAGC C T AACAC T GGCC CCCAGCACC 
TCCTCCCCTGGGAGGCCTTATCCTCAAGGAAGGACTTCTCTCCAAGGGCAGGCTGTTAGGCCCCT 
TTCTGATCAGGAGGCTTCTTTATGAATTAAACTCGCCCCACCACCCCCTCA 
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FIGURE so 

MERVTLALLLLAGLTALEANDPFANKDDPFYYDWfCNLQLSGLICGGLLAIAGIAAVLSGKCKYKS 
SQKQHS PVPEKAI PLI TPGSATTC 

Important features: 
Signal peptide: 

amino acids 1-16 



Transmembrane domain: 

amino acids 36-59 



l t l N-myristoylation sites. 

amino acids 41-47, 45-51, 84-90 



Extracellular proteins SCP/Tpx-l/Ag5/PR-l/Sc7 . 

amino acids 54-67 
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FIGURE si 



w 

N. 

E : 



GTGGAC TC TGAGAAGCCCAGGCAGT TGAGGACAGGAGAGAGAAGGCT GCAGACCCAGAGGGAGGG 
AGGACAGGGAGTCGGAAGGAGGAGGACAGAGGAGGGCACAGAGACGCAGAGCAAGGGCGGCAAGG 
AGGAGACCC T GGT GGGAGGAAGACACT CTGGAGAGAGAGGGGGCT GGGCAGAGATGAAGT TCCAG 
GGGCCCCTGGCCTGCCTCCTGCTGGCCCTCTGCCTGGGCAGTGGGGAGGCTGGCCCCCTGCAGAG 
CGGAGAGGAAAGCACTGGGACAAATAT TGGGGAGGCCC TTGGACATGGCC TGGGAGACGCCCTGA 
GCGAAGGGGTGGGAAAGGCCATTGGCAAAGAGGCCGGAGGGGCAGCTGGCTCTAAAGTCAGTGAG 
GCCCTTGGCCAAGGGACCAGAGAAGCAGTTGGCACTGGAGTCAGGCAGGTTCCAGGCTTTGGCGC 
AGCAGATGCTTTGGGCAACAGGGTCGGGGAAGCAGCCCATGCTCTGGGAAACACTGGGCACGAGA 
TTGGCAGACAGGCAGAAGATGTCATTCGACACGGAGCAGATGCTGTCCGCGGCTCCTGGCAGGGG 
GTGCCTGGCCACAGTGGTGCTTGGGAAACTTCTGGAGGCCATGGCATCTTTGGCTCTCAAGGTGG 
CCTTGGAGGCCAGGGCCAGGGCAATCCTGGAGGTCTGGGGACTCCGTGGGTCCACGGATACCCCG 
GAAACTCAGCAGGCAGCTTTGGAATGAATCCTCAGGGAGCTCCCTGGGGTCAAGGAGGCAATGGA 
GGGCCACCAAACTTTGGGACCAACACTCAGGGAGCTGTGGCCCAGCCTGGCTATGGTTCAGTGAG 
ffl AGCCAGCAACCAGAATGAAGGGTGCACGAATCCCCCACCATCTGGCTCAGGTGGAGGCTCCAGCA 
S ACTCTGGGGGAGGCAGCGGCTCACAGTCGGGCAGCAGTGGCAGTGGCAGCAATGGTGACAACAAC 
Q AATGGCAGCAGCAGTGGTGGCAGCAGCAGTGGCAGCAGCAGTGGCAGCAGCAGTGGCGGCAGCAG 
Jff TGGCGGCAGCAGTGGTGGCAGCAGTGGCAACAGTGGTGGCAGCAGAGGTGACAGCGGCAGTGAGT 
JJ* CCTCCTGGGGATCCAGCACCGGCTCCTCCTCCGGCAACCACGGTGGGAGCGGCGGAGGAAATGGA 
fV CATAAACCCGGGTGTGAAAAGCCAGGGAATGAAGCCCGCGGGAGCGGGGAATCTGGGATTCAGGG 
I1J CT TCAGAGGACAGGGAGT T TCCAGCAACATGAGGGAAATAAGCAAAGAGGGCAATCGCC T CCT T G 

GAGGCTC TGGAGACAATTATCGGGGGCAAGGGTCGAGC TGGGGCAGTGGAGGAGGTGACGCTGT T 
GGTGGAGTCAATACTGTGAACTCTGAGACGTCTCCTGGGATGTTTAACTTTGACACTTTCTGGAA 
GAAT T T TAAATCCAAGC TGGGTT T CATCAAC TGGGATGCCATAAACAAGGACCAGAGAAGC TC T C 
GCATCCCGTGACCTCCAGACAAGGAGCCACCAGATTGGATGGGAGCCCCCACACTCCCTCCTTAA 
AACACCACCCTCTCATCACTAATCTCAGCCCTTGCCCTTGAAATAAACCTTAGCTGCCCCACAAA 
AAAAAAAMAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAZU\AAAAAAAAAAAAAAA*U\AAAAAAAAAAAAAAAAA^ 
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FIGURE 52 

MKFQGPIACLLLALCLGSGEAGPLQSGEESTGTNIGEALGHGLGDALSEGVGKAIGKEAGGAAGSKVS 
EALGQGTREAVGTGVRQVPGFGAADALGNRVGEAAHALGNT GHE I GRQAEDVI RHGADAVRGSWQGVP 
GHSGAWETSGGHGIFGSQGGLGGQGQGNPGGLGTPWVHGYPGNSAGSFGMNPQGAPWGQGGNGGPPNF 
GTNTQGAVAQPGYGSVRASNQNEGCTNPPPSGSGGGSSNSGGGSGSQSGSSGSGSNGDNNNGSSSGGS 
SSGSSSGSSSGGSSGGSSGGSSGNSGGSRGDSGSESSWGSSTGSSSGNHGGSGGGNGHKPGCEKPGNE 
ARGSGESGIQGFRGQGVSSNMREISKEGNRLLGGSGDNYRGQGSSWGSGGGDAVGGVNTVNSETSPGM 
FNFDTFWKNFKSKLGFINWDAINKDQRSSRIP 



m 

h" 

5 ^ 



Signal peptide: 

amino acids 1-21 

N-glycosylation site. 

amino acids 265-269 

Glycosaminoglycan attachment site. 

amino acids 235-239, 237-241, 244-248, 255-259, 324-328, 388-392 



11! 

&i Casein kinase II phosphorylation site. 



amino acids 26-30, 109-113, 259-263, 300-304, 304-308 



N-myristoylation si te . 

amino acids 17-23, 32-38, 42-48, 50-56, 60-66, 61-67, 64-70, 74-80, 

90-96, 96-102, 130-136, 140-146, 149-155, 152-158, 155-161, 

159-165, 163-169, 178-184, 190-196, 194-200, 199-205, 218-224, 

236-242, 238-244, 239-245, 240-246, 245-251, 246-252, 249-252, 

253-259, 256-262, 266-272, 270-276, 271-277, 275-281, 279-285, 

283-289, 284-290, 287-293, 288-294, 291-297, 292-298, 295-301, 

298-304, 305-311, 311-317, 315-321, 319-325, 322-328, 323-329, 

325-331, 343-349, 354-360, 356-362, 374-380, 381-387, 383-389, 
387-393, 389-395, 395-401 

Cell attachment sequence. 

amino acids 301-304 
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FIGURE sa 



GGAGAAGAGGTTGTGTGGGACAAGCTGCTCCCGACAGAAGGATOTCGCTGCTGAGCCTGCCCTGG 
CTGGGCCTCAGACCGGTGGCAATGTCCCCATGGCTACTCCTGCTGCTGGTTGTGGGCTCCTGGCT 
ACTCGCCCGCATCCTGGCTTGGACCTATGCCTTCTATAACAACTGCCGCCGGCTCCAGTGTTTCC 
CACAGCCCCCAAAACGGAACTGGTTTTGGGGTCACCTGGGCCTGATCACTCCTACAGAGGAGGGC 
TTGAAGGACTCGACCCAGATGTCGGCCACCTATTCCCAGGGCTTTACGGTATGGCTGGGTCCCAT 
CATCCCCTTCATCGTTTTATGCCACCCTGACACCATCCGGTCTATCACCAATGCCTCAGCTGCCA 
TTGCAC CCAAGGATAAT C TC TTCAT CAGGT TCCT GAAGCCC TGGCT GGGAGAAGGGATACTGC TG 
AGTGGCGGTGACAAGTGGAGCCGCCACCGTCGGATGCTGACGCCCGCCTTCCATTTCAACATCCT 
GAAGTCC TATATAACGAT CT TCAACAAGAGTGCAAACATCAT GCT T GACAAGTGGCAGCACCTGG 
CC T CAGAGGGCAGCAGTCGTCTGGACATGT TT GAGCACATCAGCC T CATGACC TT GGACAGTC TA 
CAGAAATGCATCT TCAGC TT TGACAGCCAT T GTCAGGAGAGGCCCAGTGAATATAT T GCCACCAT 
%j CT T GGAGCT CAGT GCCC T TGTAGAGAAAAGAAGCCAGCATATCCTCCAGCACATGGACT T TC TGT 

L K ATTACCTCTCCCATGACGGGCGGCGCTTCCACAGGGCCTGCCGCCTGGTGCATGACTTCACAGAC 
pi GCTGTCATCCGGGAGCGGCGTCGCACCCTCCCCACTCAGGGTATTGATGATTTTTTCAAAGACAA 



If! 



AGCCAAGTCCAAGACTTTGGATTTCATTGATGTGCTTCTGCTGAGCAAGGATGAAGATGGGAAGG 
CATTGTCAGATGAGGATATAAGAGCAGAGGCTGACACC TTCATGTT TGGAGGCCATGACACCACG 
GCCAGTGGCCTCTCCTGGGTCCTGTACAACCTTGCGAGGCACCCAGAATACCAGGAGCGCTGCCG 
ACAGGAGGTGCAAGAGC T TC TGAAGGACCGC GATCC TAAAGAGAT T GAATGGGACGACC TGGCCC 
AGCTGCCCTTCCTGACCATGTGCGTGAAGGAGAGCCTGAGGTTACATCCCCCAGCTCCCTTCATC 
TCCCGATGCTGCACCCAGGACATTGTTCTCCCAGATGGCCGAGTCATCCCCAAAGGCATTACCTG 
CCTCATCGATATTATAGGGGTCCATCACAACCCAACTGTGTGGCCGGATCCTGAGGTCTACGACC 
CCTTCCGCTTTGACCCAGAGAACAGCAAGGGGAGGTCACCTCTGGCTTTTATTCCTTTCTCCGCA 
GGGCCCAGGAACTGCATCGGGCAGGCGTTCGCCATGGCGGAGATGAAAGTGGTCCTGGCGTTGAT 
GCTGCTGCACTTCCGGTTCCTGCCAGACCACACTGAGCCCCGCAGGAAGCTGGAATTGATCATGC 
GCGCCGAGGGCGGGCTTTGGCTGCGGGTGGAGCCCCTGAATGTAGGCTTGCAGTGACTTTCTGAC 
CCAT CCACC TGT T T TT T TGCAGAT T GTCATGAATAAAACGGT GCTGTCAAA 
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FIGURE zj. 

MSLLSLPWLGLRPVAMSPWLLLLLWGSWLIARILAWT^ 

LIT PTEEGLKDSTQMSAT YSQGFTVWLGP 1 1 PFI VLCHPDT I RS I TNASAAI APKDNLFIRFLKP 
WLGEGILLSGGDKWSRHRRMLTPAFHFNILKSYITIFNKSANIMLDKWQHIASEGSSRLDMFEHI 
SLMTLDS LQKC I FS FDSHCQERPSE YI AT I LELSALVEKRS QH I LQHMDFLYYLSHDGRRFHRAC 
RLVHDFTDAVIRERRRTLPTQGIDDFFKDKAKSKTLDFIDVLLLSKDEDGKALSDEDIRAEADTF 
MFGGHDTTAS GLS WVL YNLARHPE YQE RCRQEVQE LLKDRDPKE I EWDDLAQLPFLTMCVKE S LR 
LHPPAPFISRCCTQDIVLPDGRVIPKGITCLIDIIGVHHNPTWPDPEVYDPFRFDPENSKGRSP 
LAFI PFSAGPRNC I GQAFAMAEMKWLALMLLHFRFLPDHTE PRRKLEL IMRAEGGLWLRVEPLN 
VGLQ 

Important features: 
Transmembrane domains: 

amino acids 13-32 (type II), 77-102 

Cytochrome P450 cysteine heme-iron ligand signature. 

amino acids 461-471 

N-glycosylation sites. 

amino acids 112-116, 168-172 
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FIGURE .rr 



m 
pi 

if?? 



AT CGCATCAATT GGGAGTACCATCT TCCTCATGGGACCAGTGAAACAGCTGAAGCGAATGTTTGA 
GCCTACTCGTTTGATTGCAACTATCATGGTGCTGTTGTGTTTTGCACTTACCCTGTGTTCTGCCT 
TTTGGTGGCATAACAAGGGACTTGCACTTATCTTCTGCATTTTGCAGTCTTTGGCATTGACGTGG 
TACAGCCTTTCCTTCATACCATTTGCAAGGGATGCTGTGAAGAAGTGTTTTGCCGTGTGTCTTGC 
ATAAT TCATGGCCAGT T T TATGAAGCT T TGGAAGGCAC TATGGACAGAAGCTGGT GGACAGT T T T 
GTAACTATCTTCGAAACCTCTGTCTTACAGACATGTGCCTTTTATCTTGCAGCAATGTGTTGCTT 
GT GAT TCGAACAT TTGAGGGT TACT T T TGGAAGCAACAATACATT C TCGAACCTGAATGTCAGTA 
GCACAGGATGAGAAGTGGGTTCTGTATCTTGTGGAGTGGAATCTTCCTCATGTACCTGTTTCCTC 
TC TGGATGT T GTCCCAC TGAATTCCCAT GAATACAAACC TATT CAGCAACAGCAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 56 



MGPVKQLKRMFE PTRL I AT IMVLLC FALTLCS AFWWHNKGLAL I FC I LQSLALT WYS LSFIPFAR 
DAVKKC FAVCLA 

Important features: 
Signal peptide: 

amino acids 1-33 

|*> Type II fibronectin collagen-binding domain protein. 

CI amino acids 30-72 



5 hi 
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FIGURE S7 



C GGCT CGAGCT CGAGC C GAATCGGCTC GAG GGGCAGT GGAGCACCCAGCAGGCCGCCAACATGCTCT GT CTGTG CCTG 
TACGTGCCGGTCATCGGGGAAGCCCAGACCGAGTTCCAGTACTTTGAGTCGAAGGGGCTCCCTGCCGAGCTGAAGTCC 
ATTTTCAAGCTCAGTGTCTTCATCCCCTCCCAGGAATTCTCCACCTACCGCCAGTGGAAGCAGAAAATTGTACAAGCT 
GGAGATAAGGACCTTGATGGGCAGCTAGACTTTGAAGAATTTGTCCATTATCTCCAAGATCATGAGAAGAAGCTGAGG 
CT GGTGTTTAAGATTTT GGACAAAAAGAAT GATGGAC GCAT T GACGCGCAGGAGAT CAT GCAGT CCCT GCGGGACTTG 
GGAGT CAAGATAT CTGAACAGCAGGCAGAAAAAATTCTCAAGAGCAT GGATAAAAACG GCAC GATGAC CATCGACTG G 
AACGAGT GGAGAGACTAC CACCT CCTCCAC CCC GTGGAAAACAT CCC C GAGAT CATCCT CTACT GGAAGCAT T CCACG 
AT CTT TGATGT GGGTGAGAATCT AACGGT CCC GGATGAGTTCACAGT GGAGGAGAGGCAGACGGGGAT GTGGTGGAGA 
CACCTGGT GGC AGGAGGT GGGGCAGGGGCCGTAT CCAGAACCT GCAC GGCCC CC CTGGACAGGCTCAAGGTGCT CAT G 
CAGGTCCATGC CT C CC GC AGCAACAACAT GGGCATCGTT GGT GGCTT CACTCAGATGATTCGAGAAGGAGGGGC CAGG 
T CACT CTGGCGGGGCAATGGCAT CAACGTCCT CAAAATTGCCCCCGAATCAGC CATCAAATT CATGGC CTAT GAGCAG 
ATCAAGCGCCTTGTTGGTAGTGACCAGGAGACTCTGAGGATTCACGAGAGGCTTGTGGCAGGGTCCTTGGCAGGGGCC 
AT CGCC CAGAGCAGCATCTACCCAATGGAGGTC C TGAAG ACCCGGAT GGCGCT GCGGAAGACAGGCCAGTACT CAGGA 
tj, ATGCTGGACTGCGCCAGGAGGATCCTGGCCAGAGAGGGGGTGGCCGCCTTCTACAAAGGCTATGTCCCCAACATGCTG 
L-ij. GGCATCATCCCCTATGCCGGCATCGACCTTGCAGTCTACGAGACGCTCAAGAATGCCTGGCTGCAGCACTATGCAGTG 
M AACAGCGCGGACCCCGGCGTGTTTGTGCTCCTGGCCTGTGGCACCATGTCCAGTACCTGTGGCCAGCTGGCCAGCTAC 
f 1 CCCCTGGCCCTAGTCAGGACCCGGATGCAGGCGCAAGCCTCTATTGAGGGCGCTCCGGAGGTGACCATGAGCAGCCTC 
IS TTCAAACATATCCTGCGGACCGAGGGGGCCTTCGGGCTGTACAGGGGGCTGGCCCCCAACTTCATGAAGGTCATCCCA 
l^l GCTGTGAGCATCAGCTACGTGGTCTACGAGAACCTGAAGATCACCCTGGGCGTGCAGTCGCGGTSACGGGGGGAGGGC 
Ijij CGCCCGGCAGTGGACTCGCTGATCCTGGGCCGCAGCCTGGGGTGTGCAGCCATCTCATTCTGTGAATGTGCCAACACT 
£1 AAGCTGTCTCGAGCCAAGCTGTGAAAACCCTAGACGCACCCGCAGGGAGGGTGGGGAGAGCTGGCAGGCCCAGGGCTT 
GTCCTGCTGACCCCAGCAGACCCTCCTGTTGGTTCCAGCGAAGACCACAGGCATTCCTTAGGGTCCAGGGTCAGCAGG 
CTCCGGGCTCACATGTGTAAGGACAGGACATTTTCTGCAGTGCCTGCCAATAGTGAGCTTGGAGCCTGGAGGCCGGCT 
l^ij, TAGTTCTTCCATTTCACCCTTGCAGCCAGCTGTTGGCCACGGCCCCTGCCCTCTGGTCTGCCGTGCATCTCCCTGTGC 
■w 3 CCTCTTGCTGCCTGCCTGTCTGCTGAGGTAAGGTGGGAGGAGGGCTACAGCCCACATCCCACCCCCTCGTCCAATCCC 
g ATAATCCATGAT GAAAGGTGAGGTCACGT GGCCT CCCAGGCCT GACTT C CC AACCTACAGCATT GACGCCAACTTGGC 

g«t TGTGAAGGAAGAGGAAAGGATCTGGCCTTGTGGTCACTGGCATCTGAGCCCTGCTGATGGCTGGGGCTCTCGGGCATG 
CTTGGGAGTGCAGGGGGCTCGGGCTGCCTGGCCTGGCTGCACAGAAGGCAAGTGCTGGGGCTCATGGTGCTCTGAGCT 
y! GGCCTGGACCCTGTC^GGATGGGCCCCACCTCAGAACCAAACTCACTGTCCCCACTGTGGCATGAGGGCAGTGGAGCA 
*»$ CCATGTT TGAGGGCGAAGGGCAGAGCGTTT GT GT GTT CT GGGGAGGGAAGGAAAAGGTGTTGGAGGCCTTAATT ATGG 

W ACTGTTGGGAAAAGGGTTTTGTCCAGAAGGACMGCCGGACAAATGAGCGACTTCTGTGCTTCCAGAGGAAGACGAGG 
||| GAGCAGGAGCTT GGCTGACTGCT CAGAGTCTGTTCTGACGCCCTGGGGGTTCCT GTC CAACCCCAGCAGGGGC GCAG C 

m| GGGACCAGCCCCACAT TC CACT T GTGT CACTGCTTGGAACCTATTTATTTTGTATTTATTTGAACAGAGT TATGTCCT 

W AACTATTTTTATAGATT TGTTTAATTAATAGCTT GTCATTTT CAAGTT CATTTTTTATTCATAT TTATGTTCATGGTT 

f |l GATTGTACCTTCCCAAGCCCGCCCAGTGGGATGGGAGGAGGAGGAGAAGGGGGGCCTTGGGCCGCTGCAGTCACATCT 
GT CCAGAGAAATT CCT TTT GGGACT GGAGGCAGAAAAGCGGCCAGAAGGCAGCAGCCCT GGCT C CTTTCCTTT GGCAG 
GTT GGGGAAGGGCT TGCC C CCAGCCTTAGGAT TTCAGGGTTTGACT GGGGGCGTGGAGAGAGAGGGAGGAACCTCAAT 
AACCTTGAAGGTGGAATCCAGTTATTTCCTGCGCTGCGAGGGTTTCTTTATTTCACTCTTTTCTGAATGTCAAGGCAG 
TGAGGTGCCTCTCACTGTGAATTTGTGGTGGGCGGGGGCTGGAGGAGAGGGTGGGGGGCTGGCTCCGTCCCTCCCAGC 
CTT CT GCT GCCCTT GCTTAACAAT GCCGGC CAACTGGCGAC CT CACGGTTGCACTTCCATTCCACCAGAATGACCT GA 
T GAGGAAATCTTCAAT AGGATGCAAAGATCAAT GCAAAAATT GTTATATAT GAACATATAACT GGAGTCGTCAAAAAG 
CAAATTAAGAAAGAATTGGACGTTAGAAGTTGTGATTTAAAGCAGCCTTCTA^ 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE s8 

MLCLCLYVPVIGEAQTEFQYFESKGLPAELKSIFKLSVFIPSQEFSTYRQWKQKIVQAGDKDLDG 
QLDFEEFVHYLQDHEKKLRLVFKILDKKNDGRIDAQEIMQSLRDLGVKISEQQAEKILKSMDKNG 
TMT I DWNE WRD YHLLHPVENI PE 1 1 LYWKHS T I FDVGENLT VPDE FT VEERQTGMWWRHL VAGGG 
AGAVSRTCTAPLDRLKVIMQVHASRSNNMGIVGGFTQMIREGGARSLWRGNGINVLKIAPESAIK 
FMAYEQ IKRL VGS DQETLRIHERLVAGSLAGAI AQS S I YPME VLKTRMALRKTGQYS GMLDCARR 
I LARE GVAAFYKG YVPNMLG 1 1 PYAG I DLAVYE T LKNAWLQH YAVNS ADPGVFVLLACGTMS S TC 
GQLASYPLALVRTRMQAQAS IEGAPEVTMSSLFKHILRTEGAFGLYRGLAPNFMKVIPAVS ISYV 
VYENLKITLGVQSR 

Important features: 
Signal peptide : 

amino acids 1-16 

Putative transmembrane domains: 

amino acids 284-304, 339-360, 376-394 

Mitochondrial energy transfer proteins signature. 

amino acids 206-215, 300-309 

N-glycosylation sites. 

amino acids 129-133, 169-173 

Elongation Factor-hand calcium-binding protein. 

amino acids 54-73, 85-104, 121-140 
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FIGURE 5Q 



GGAAGGCAGCGGCAGC TC CAC T CAGCCAGTACCCAGATACGCTGGGAAC CT TCCCCAGCCATGGC 
T T CCC TGGGGCAGATCC T CT TCT GGAGCATAAT TAGCATCATCAT TAT T CTGGCT GGAGCAAT T G 
CACTCATCATTGGCTTTGGTATTTCAGGGAGACACTCCATCACAGTCACTACTGTCGCCTCAGCT 
GGGAACATTGGGGAGGATGGAATCCTGAGCT GCACTT T TGAACCT GACATCAAACT T TC T GATAT 
CGTGATACAATGGCTGAAGGAAGGTGTTTTAGGCTTGGTCCATGAGTTCAAAGAAGGCAAAGATG 
AGCTGTCGGAGCAGGATGAAATGTTCAGAGGCCGGACAGCAGTGTTTGCTGATCAAGTGATAGTT 
GGCAATGCCTCTTTGCGGCTGAAAAACGTGCAACTCACAGATGCTGGCACCTACAAATGTTATAT 
CAT CACT TCTAAAGGCAAGGGGAAT GC TAACCT T GAGTATAAAAC T GGAGCCT TCAGCATGCCGG 
AAGTGAATGTGGACTATAATGCCAGCTCAGAGACCTTGCGGTGTGAGGCTCCCCGATGGTTCCCC 
CAGCCCACAGTGGT CTGGGCATCCCAAGT TGACCAGGGAGCCAACT TC TCGGAAGT C T CCAATAC 
CAGCTTTGAGCTGAACTCTGAGAATGTGACCATGAAGGTTGTGTCTGTGCTCTACAATGTTACGA 
TCAACAACACATACTCCTGTATGATTGAAAATGACATTGCCAAAGCAACAGGGGATATCAAAGTG 
m ACAGAATCGGAGATCAAAAGGCGGAGTCACCTACAGCTGCTAAACTCAAAGGCTTCTCTGTGTGT 
g CTCTTCTTTCTTTGCCATCAGCTGGGCACTTCTGCCTCTCAGCCCTTACCTGATGCTAAA ATAA T 
GT GCC T TGGCCACAAAAAAGCATGCAAAGTCATT GT TACAACAGGGATCTACAGAACTAT TTCAC 
CACC AGATAT GAC C TAG T T T TAT AT T TC TGGGAGGAAAT GAAT TCATATC TAGAAGT CTGGAGTG 
AGCAAACAAGAGCAAGAAACAAAAAGAAGCCAAAAGCAGAAGGCTCCAATATGAACAAGATAAAT 
CTATCTTCAAAGACATATTAGAAGTTGGGAAAATAATTCATGTGAACTAGACAAGTGTGTTAAGA 
GTGATAAGTAAAATGCACGTGGAGACAAGTGCATCCCCAGATCTCAGGGACCTCCCCCTGCCTGT 
CACCTGGGGAGTGAGAGGACAGGATAGTGCAT GT T CT T TGTC T CT GAAT T TT TAGT T ATATGT GC 
TGTAATGTTGCTCTGAGGAAGCCCCTGGAAAGTCTATCCCAACATATCCACATCTTATATTCCAC 
AAATTAAGCTGTAGTATGTACCCTAAGACGCTGCTAATTGACTGCCACTTCGCAACTCAGGGGCG 
GCTGCATTTTAGTAATGGGTCAAATGATTCACTTTTTATGATGCTTCCAAAGGTGCCTTGGCTTC 
TCT TCCCAAC TGACAAATGCCAAAGTTGAGAAAAATGATCATAATT TTAGCATAAACAGAGCAGT 
C GGG G AC AC C GAT T T T AT AAAT AAAC T GAG CAC CTTCTTTT T AAAC AAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



m 
b 



APP ID=10063716 



Page 213 of 322 



FIGURE 60 

MASLGQILFWSIISIIIILAGAIALIIGFGISGRHSITVTTVASAGNIGEDGILSCTFEPDIKLS 
DIVIQWLKEGVLGLVHEFKEGKDELSEQDEMFRGRTAVFADQVIVGNASLRLKNVQLTDAGTYKC 
YIITSKGKGNANLEYKTGAFSMPEVNVDYNASSETLRCEAPRWFPQPTWWASQVDQGANFSEVS 
NTSFELNSENVTMKWSVLYNVTINNTYSCMIENDIAKATGDIPCVTESEIKRRSHLQLLNSKASL 
CVS S FFAI S WALLPLS PYLMLK 

Important features: 
Signal peptide: 

amino acids 1-28 

Transmembrane domain: 

amino acids 258-281 

N-glycosylation sites. 

amino acids 112-116, 160-164, 190-194, 196-200, 205-209, 216-220, 
220-224 

N-myristoylation sites. 

amino acids 52-58, 126-132, 188-194 
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FIGURE 61 

TGACGTCAGAAT CACCATGGC CAGCTATCC T TACCGGCAGGGC TGCCCAGGAGC TGCAGGACAAG 
CACCAGGAGCCCCTCCGGGTAGCTACTACCCTGGACCCCCCAATAGTGGAGGGCAGTATGGTAGT 
GGGCTACCCCCTGGTGGTGGTTATGGGGGTCCTGCCCCTGGAGGGCCTTATGGACCACCAGCTGG 
TGGAGGGCCCTATGGACACCCCAATCCTGGGATGTTCCCCTCTGGAACTCCAGGAGGACCATATG 
GCGGTGCAGCTCCCGGGGGCCCCTATGGTCAGCCACCTCCAAGTTCCTACGGTGCCCAGCAGCCT 
GGGCTTTATGGACAGGGTGGCGCCCCTCCCAATGTGGATCCTGAGGCCTACTCCTGGTTCCAGTC 
i; GGT GGAC TCAGAT CACAGTGGCTATAT C TCCATGAAGGAGC TAAAGCAGGCCC T GGT CAAC T GCA 

p ATT GGT CTTCAT TCAATGATGAGACCT GCCTCAT GATGATAAACAT GTT TGACAAGACCAAGTCA 

O GGCCGCATCGATGTCTACGGCTTCTCAGCCCTGTGGAAATTCATCCAGCAGTGGAAGAACCTCTT 

m 

f* CCAGCAGTATGACCGGGACCGCTCGGGCTCCATTAGCTACACAGAGCTGCAGCAAGCTCTGTCCC 

J ; J AAATGGGCTACAACCTGAGCCCCCAGTTCACCCAGCTTCTGGTCTCCCGCTACTGCCCACGCTCT 

GCCAATCCTGCCATGCAGCTTGACCGCTTCATCCAG 

g! GGCCTTCCGGGAGAAGGACACAGCTGTACAAGGCAACATCCGGCTCAGCTTCGAGGACTTCGTCA 

.». CCATGACAGC TT C T CGGATGC TATGACCCAACCAT C TGT GGAGAGT GGAGTGCACCAGGGACC T T 

^ TCCTGGCTTCTTAGAGTGAGAGAAGTATGTGGACATCTCTTCTTTTCCTGTCCCTCTAGAAGAAC 

^ ATTCTCCCTTGCTTGATGCAACACTGTTCCAAAAGAGGGTGGAGAGTCCTGCATCATAGCCACCA 

ft AATAGTGAGGACCGGGGCTGAGGCCACACAGATAGGGGCCTGATGGAGGAGAGGATAGAAGTTGA 

In;;' 

Q ATGTCCTGATGGCCATGAGCAGTTGAGTGGCACAGCCTGGCACCAGGAGCAGGTCCTTGTAATGG 
III AGTTAGTGTCCAGTCAGCTGAGCTCCACCCTGATGCCAGTGGTGAGTGTTCATCGGCCTGTTACC 
GTTAGTACCTGTGTTCGCTCACCAGGCCATCCTGTCAAACGAGCCCATTTTCTCCAAAGTGGAAT 
CT GACCAAGCAT GAGAGAGATCTGT CTATGGGACCAGT GGC T T GGATTCTGCCACACCCATAAAT 
CCTTGTGTGTTAACTTCTAGCT.GCCTGGGGCTGGCCCTGCTCAGACAAATCTGCTCCCTGGGCAT 
CTTTGGCCAGGCTTCTGCCCCCTGCAGCTGGGACCCCTCACTTGCCTGCCATGCTCTGCTCGGCT 
TCAGTCTCCAGGAGACAGTGGTCACCTCTCCCTGCCAATACTTTTTTTAATTTGCATTTTTTTTC 
ATTTGGGGCCAAAAGTCCAGTGAAATTGTAAGCTTCAATAAAAGGATGAAACTCTGA 
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FIGURE 62 



MASYPYRQGCPGAAGQAPGAPPGSYYPGPPNSGGQYGSGLPPGGGYGGPAPGGPYGPPAGGGPYG 

HPNPGMFPSGTPGGPYGGAAPGGPYGQPPPSSYGAQQPGLYGQGGAPPNVDPEAYSWFQSVDSDH 

SGYISMKELKQALWCNWSSFNDETCLMMINMFDKTKSGRIDVYGFSALWKFIQQWKNLFQQY 

DRSGSISYTELQQALSQMGYNLSPQFTQLLVSRYCPRSANPAMQLDRFIQVCTQLQVLTEAFREK 

DTAVQGNIRLSFEDFVTMTASRML 

Important features of the protein: 
Signal peptide: 
amino acids 1-19 

N-glycosylation site. 

amino acids 147-150 

Casein kinase II phosphorylation sites. 

amino acids 135-138, 150-153, 202-205, 271-274 

N-myristoylation sites. 

amino acids 9-14, 15-20, 19-24, 33-38, 34-39, 39-44, 43-48, 61- 
66, 70-75, 78-83, 83-88, 87-92, 110-115 
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FIGURE 6s 



CAGGATGCAGGGCCGCGTGGCAGGGAGCTGCGCTCCTCTGGGCCTGCTCCTGGTCTGTCTTCATC 
TCCCAGGCCTCTTTGCCCGGAGCATCGGTGTTGTGGAGGAGAAAGTTTCCCAAAACTTCGGGACC 
AACTTGCCTCAGCTCGGACAACCTTCCTCCACTGGCCCCTCTAACTCTGAACATCCGCAGCCCGC 
TCTGGACCCTAGGTCTAATGACTTGGCAAGGGTTCCTCTGAAGCTCAGCGTGCCTCCATCAGATG 
GCTTCCCACCTGCAGGAGGTTCTGCAGTGCAGAGGTGGCCTCCATCGTGGGGGCTGCCTGCCATG 
GATTCCTGGCCCCCTGAGGATCCTTGGCAGATGATGGCTGCTGCGGCTGAGGACCGCCTGGGGGA 
AGCGCTGCCTGAAGAACTCTCTTACCTCTCCAGTGCTGCGGCCCTCGCTCCGGGCAGTGGCCCTT 

jjaais 

p t TGCCTGGGGAGTCTTCTCCCGATGCCACAGGCCTCTCACCTGAGGCTTCACTCCTCCACCAGGAC 
Q TCGGAGTCCAGACGACTGCCCCGTTCTAATTCACTGGGAGCCGGGGGAAAAATCCTTTCCCAACG 
|fl CCCTCCCTGGTCTCTCATCCACAGGGTTCTGCCTGATCACCCCTGGGGTACCCTGAATCCCAGTG 
W TGTCCTGGGGAGGTGGAGGCCCTGGGACTGGTTGGGGAACGAGGCCCATGCCACACCCTGAGGGA 

ATCTGGGGTAT CAATAATCAACC CC CAGGTACCAGC TGGGGAAATAT TAATCGGTAT CCAGGAGG 
$p CAGCTGGGGAAATATTAATCGGTATCCAGGAGGCAGCTGGGGGAATATTAATCGGTATCCAGGAG 

GCAGCTGGGGGAATATTCATCTATACCCAGGTATCAATAACCCATTTCCTCCTGGAGTTCTCCGC 



CCTCCTGGCTCTTCTTGGAACATCCCAGCTGGCTTCCCTAATCCTCCAAGCCCTAGGTTGCAGTG 
*M GGGC TAGAGCACGATAGAGGGAAAC CCAACAT TGGGAGT TAGAGT CCT GCTCCCGCCCCT TGC TG 

J? TGTGGGCTCAATCCAGGCCCTGTTAACATGTTTCCAGCACTATCCCCACTTTTCAGTGCCTCCCC 

111 

f?| TGCTCATCTCCAATAAAATAAAAGCACTTATGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

Hi AAAAAAAAAAMAAAAAAAAAAAAAAAAAA&AAAAAAAAAAAAAAAAAAAA 
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FIGURE 6a 



MQGRVAGSCAPLGLLLVCLHLPGLFARSIGVVEEKVSQNFGTNLPQLGQPSSTGPSNSEHPQPAL 
DPRSNDLARVPLKLSVPPSDGFPPAGGSAVQRWPPSWGLPAMDSWPPEDPWQMMAAAAEDRLGEA 
LPEELSYLSSAAALAPGSGPLPGESSPDATGLSPEASLLHQDSESRRLPRSNSLGAGGKILSQRP 
PWSLIHRVLPDHPWGTLNPSVSWGGGGPGTGWGTRPMPHPEGIWGINNQPPGTSWGNINRYPGGS 
WGNINRYPGGSWGNINRYPGGSWGNIHLYPGINNPFPPGVLRPPGSSWNIPAGFPNPPSPRLQWG 

Important features of the protein: 
Signal peptide: 

amino acids 1-26 

Casein kinase II phosphorylation sites. 

amino acids 56-59, ■ 155-158 

N-myristoylation sites . 

amino acids 48-53, 220-225, 221-226, 224-229, 247-252, 258-263, 
259-264, 269-274, 270-275, 280-285, 281-286, 305-310 
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FIGURE 6 s 

AAGGAGAGGCCACCGGGACTTCAGTGTCTCCTCCATCCCAGGAGCGCAGTGGCCACTATGGGGTC 
TGGGCTGCCCCTTGTCCTCCTCTTGACCCTCCTTGGCAGCTCACATGGAACAGGGCCGGGTATGA 
CTTTGCAACTGAAGCTGAAGGAGTCTTTTCTGACAAATTCCTCCTATGAGTCCAGCTTCCTGGAA 
TTGCTTGAAAAGCTCTGCCTCCTCCTCCATCTCCCTTCAGGGACCAGCGTCACCCTCCACCATGC 
AAGATCTCAACACCATGTTGTCTGCAACACATGACAGCCATTGAAGCCTGTGTCCTTCTTGGCCC 
GGGCTTTTGGGCCGGGGATGCAGGAGGCAGGCCCCGACCCTGTCTTTCAGCAGGCCCCCACCCTC 
CTGAGTGGCAATAAATAAAATTCGGTATGCTG 




m 

?, ir'r. 

; :; 
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FIGURE 66 



if-'- 



US 



J, 

SI 
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ru 



MGSGLPLVLLLTLLGSSHGTGPGMTLQLKLKESFLTNSSYESSFLELLEKLCLLLHLPSGTSVTL 
HHARS QHHWCNT 

Important features: 
Signal peptide: 

amino acids 1-19 

N-glycosylation site. 

amino acids 37-41 

N-myristoylation sites. 

amino acids 15-21, 19-25, 60-66 
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FIGURE 67 



ACGGACCGAGGGTTCGAGGGAGGGACACGGACCAGGAACCTGAGCTAGGTCAAAGACGCCCGGGC 
CAGGTGCCCCGTCGCAGGTGCCCCTGGCCGGAGATGCGGTAGGAGGGGCGAGCGCGAGAAGCCCC 
TTCCTCGGCGCTGCCAACCCGCCACCCAGCCCATGGCGAACCCCGGGCTGGGGCTGCTTCTGGCG 
CTGGGCCTGCCGTTCCTGCTGGCCCGCTGGGGCCGAGCCTGGGGGCAAATACAGACCACTTCTGC 
AAATGAGAATAGCACTGTTTTGCCTTCATCCACCAGCTCCAGCTCCGATGGCAACCTGCGTCCGG 
AAGCCATCACTGCTATCATCGTGGTCTTCTCCCTCTTGGCTGCCTTGCTCCTGGCTGTGGGGCTG 
GCACTGTTGGTGCGGAAGCTTCGGGAGAAGCGGCAGACGGAGGGCACCTACCGGCCCAGTAGCGA 
GGAGCAGTTCTCCCATGCAGCCGAGGCCCGGGCCCCTCAGGACTCCAAGGAGACGGTGCAGGGCT 

In* 

Q GCCTGCCCATCTAGGTCCCCTCTCCTGCATCTGTCTCCCTTCATTGCTGTGTGACCTTGGGGAAA 
§1 GGCAGTGCCCTCTCTGGGCAGTCAGATCCACCCAGTGCTTAATAGCAGGGAAGAAGGTACTTCAA 
Wl AGACTC TGCCCCT GAGGTCAAGAGAGGAT GGGGCTATT CACT TT TATATAT TTATATAAAAT TAG 

a '*f TAGTGAGATGTAAAAAAAAAAAAAAAAAA 
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FIGURE 68 

MANPGLGLLLALGLPFLLARWGRAWGQIQTTSANENSTVLPSSTSSSSDGNLRPEAITAIIWFS 
LLAALLLAVGLALLVRKLREKRQTEGTYRPSSEEQFSHAAEAEIAPQDSKETVQGCLPI 

Important features: 
Signal peptide: 

amino acids 1-19 

Transmembrane domain: 

amino acids 56-80 

N-glycosylation site. 

amino acids 3.6-40 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 86-90 

Tyrosine kinase phosphorylation site. 

amino acids 86-94 

N-myristoylation sites. 

amino acids 7-13, 26-32 
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FIGURE 6Q 



GC CAG GAATAACTAGAGAGGAACA ATGG GGTTATT CAGAGGTTTT GTTTT C CT CT TAGT T CT GT GCCTGCT GCACCAG 
TCAAATACTTCCTTCATTAAGCTGAATAATAATGGCTTTGAAGATATTGTCATTGTTATAGATCCTAGTGTGCCAGAA 
GATGAAAAAATAATT GAACAAATAGAGGATAT GGT GACTACAGCT T CTACGTACCT GTTT GAAG C CACAGAAAAAAGA 
TT TTTT TTCAAAAAT GTATCTATATTAATT CCT GAGAATT GGAAGGAAAAT CCT CAGTACAAAAGGCCAAAACAT GAA 
AACCAT AAACAT GCT GATGT TATAGT T G CACCACCTACACT C CCAGGTAGAGAT GAACCAT ACACCAAGCAGTT CACA 
GAAT GT G GAGAGAAAGGCGAATACAT TCACTT CAC CCCT GAC CTT CT ACTT GGAAAAAAACAAAAT GAAT AT GGACCA 
C CAG GCAAACT GTTT GT CCAT GAGT GGGCTCACCT CCGGTGGGGAGT GT TT GAT GAGTACAATGAAGATCAGCCTTTC 
TACCGTGCTAAGTCAAAAAAAATCGAAGCAACAAGGTGTTCCGCAGGTATCTCTGGTAGAAATAGAGTTTATAAGTGT 
CAAGGAGGCAGCTGT CTTAGTAGAGCAT GCAGAATTGAT T CT ACAACAAAACTGTAT GGAAAAGATT GT CAATT CTTT 
CCTGATAAAGTACAAACAGAAAAAGCAT CCATAAT GTTTATG CAAAGTATT GATTCTGTT GTT GAATTTTGTAAC GAA 
AAAACCCATAATCAAGAAGCTCCAAGCCTACAAAAC7VTAAAGTGCAATTTTAGA 

TCTGAGGATTT TAAAAACACCATACCCAT GGTGACAC CACCT CCT CCACCTGT CTTCT CATTGCT GAAGATCAGT CAA 
f-* AGAATT GTGT GCTTAGTTCTT GATAAGT CT GGAAGCAT GGGGGGTAAGGAC CGCCT AAATCGAAT GAATCAAGCAGCA 

f B | AAACAT T TCCTGCT GCAGACT GTT GAAAAT GGATCCT GGGTGGGGAT GGTTCACTT T GATAGTACTGCCACTATT GTA 

AATAAG CTAATCCAAATAAAAAG CAGT GAT GAAAGAAACACACT CAT GGCAGGATTACCTACATATCCT CTGGGAGGA 
O ACTT CCATCT GCTCT GGAATTAAATAT GCATTT CAGGT GATT GGAGAGCTACATT CCCAACT CGATGGAT CCGAAGTA 

||| CT GCT GCTGACT GAT GGGGAGGATAACACT GCAAGTT CT TGTATT GATGAAGT GAAACAAAGT GGGGCCATT GT T CAT 

f 7 l TT TATT GCTT TGGGAAGAGCTGCT GATGAAGCAGTAATAGAGAT GAGCAAGATAACAGGAGGAAGTCATT TTTAT GTT 

III T CAGAT GAAGCT CAGAACAATGGCCTCATT GAT GCTTTT GGGGCTCT T ACATCAGGAAATACTGATCT CTCCCAGAAG 

§ T CCCTT CAGCT C GAAAGTAAGG GATTAACACT GAATAGTAATGCCTGGAT GAAC GACACT GTCATAATT GATAGTACA 

, '\ GTGGGAAAGGACACGTTCTTTCTCATCACATGGAACAGTCTGCCTCCCAGTATTTCTCTCTGGGATCCCAGTGGAACA 
W r < ATAAT GGAAAATTT CACAGT GGAT GCAACTTCCAAAATGGCCTAT CT CAGT ATT C CAGGAACT GCAAAGGTGGGCACT 

f II T GGGCATACAAT CTTCAAG CCAAAGCGAACCCAGAAACATTAACTATTACAGTAACTTCT CGAGCAGCAAATT CTTCT 

GTGCCT CCAATCACAGT GAATGCTAAAAT GAATAAGGACGTAAACAGTTT CCC CAGC CCAATGAT T GTTTACGCAGAA 
* ATTCTACAAGGATATGTACCTGTTCTTGGAGCCAATGTGACTGCTTTCATTGAATC1ACAGAATGGACATACAGAAGTT 
f| TTGGAACTTTTGGATAATGGTGCAGGCGCTGATTCTTTCAAGAATGATGGAGTCTACTCCAGGTATTTTACAGCATAT 
JJJ ACAGAAAATGGCAGATATAGCTTAAAAGTT CGGGCTCAT GGAGGAGCAAACACT GCCAGGCTAAAATTAC GGCCT CCA 

hi CTGAATAGAGC CGC GTACAT ACCAGGCT GGGTAGT GAAC GGGGAAATT GAAGCAAACCCGCCAAGAC CTGAAATT GAT 

Q GAGGATACTC^GACCACCTTGGAGGATTTCAGCCGAACAG^ 

J]f CTTCCCT TGCCT GACCAATACCCACCAAGT CAAAT CACAGACCTT GAT GC CACAGTT CAT GAGGATAAGATTATT CTT 

III ACATGGACAGCACCAGGAGATAATTTT GAT GTTGGAAAAGTT CAACGTTAT AT CATAAGAATAAGTGCAAGTATT CTT 

f -1 GATCTAAGAGACAGTTTT GATGAT GCT CTTCAAGTAAATACTAC TGAT CT GT CACCAAAGGAGGCCAACT CCAAGGAA 

J! agctttgcatttaaaccagaaaatatctcagaagaaaatgcaacccacatatttattgccattaaaagtatagataaa 

111 agcaatttgacat<^aaaagtatc(^cattgcacmgtaa^^ 

cctacacctactcctactcctactcctactcctgataaaagtcataattctggagttaatatttctacgctggtattg 
tctgtgattgggtctgttgtaattgttaactttattttaagtacc1acc7vtt tgaa ccttaacgaagaaaaaaatcttc 

AAGTAGACCTAGAAGAGAGTTTTAAAAAACAAAACAATGTAAGTAAAGGATATTTCTGAATCTTAAAATTCATCCCAT 
GT GTGAT CATAAACT CATAAAAATAATTTTAAGAT GT CGGAAAAGGATACTTT GATTAAATAAAAACACTCAT GGATA 
TGTAAAAACT GT CAAGATTAAAATTTAAT AGTTT CATT TATTT GTTATT TTAT TTGTAAGAAATAGT GAT GAACAAAG 
AT C CTTTTT CATACT GATAC CT GGTTGTATATTATTT GAT GCAACAGT T TTCTGAAAT GATATTTCAAATTGCAT CAA 
GAAATTAAAATCAT CTATCT GAGTAGT CAAAATACAAGTAAAGGAGAGCAAAT AAACAACATTT G GAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 7Q 



t r 



lit 

B 

p 

5 



MGLFRGFVFLLVLCLLHQSNTSFIKLNNNGFEDIVIVIDPSVPEDEKIIEQIEDMVTTASTYLFE 
ATEKRFFFKNVSILIPENWKENPQYKRPKHENHKHADVIVAPPTLPGRDEPYTKQFTECGEKGEY 
IHFTPDLLLGKKQNEYGPPGKLFVHEWAHLRWGVFDEYNEDQPFYRAKSKKIEATRCSAGISGRN 
RVYKCQGGS CLSRACRI DS T TKL YGKDCQ FFPDKVQTEKAS IMFMQS I DS WE FCNEKTHNQEAP 
SLQNIKCNFRSTWEVISNSEDFKNTIPMVTPPPPPVFSLLKISQRIVCLVLDKSGSMGGKDRLNR 
MNQAAKHFLLQTVENGSWGMTOFDSTATITOKLIQIKSSDERNTLMAGLPTYPLGGTSICSGIK 
YAFQVI GELHSQLDGSEVLLLTEXBEDNTAS SC I DEVKQS GAI VHFI ALGRAADEAVI EMSKI TGG 
SHFYVSDEAQNNGLIDAFGALTSGNTDLSQKSLQLESKGLTLNSNAWMNDTVIIDSTVGKDTFFL 
I TWNSLPPS I SLWDPSGT IMENFTVDATSKMAYLS I PGTAKVGTWAYNLQAKAMPETLT ITVTSR 
AANSSVPPITVNAKMNKDVNSFPSEMI^ 

GADS FKNDGVYSRYFTAYTENGRYS LKVRAHGGANTARLKLRPPLNRAAYI PGWWNGE IEANPP 
RPEIDEDTQTTLEDFSRTASGGAFVVSQVPSLPL'PDQYPPSQITDLDATVHEDKIILTWTAPGDN 
FDVGKVQRYIIRISASILDLRDSFDDALQVNTTDLSPKEANSKESFAFKPENISEENATHIFIAI 
KSIDKSNLTSKVSNIAQVTLFIPQANPDDIDPTPTPTPTPTPDKSHNSGVNISTLVLSVIGSWI 
VNFILSTTI 

Signal peptide: 

amino acids 1-21 

Putative transmembrane domains: 

amino acids 284-300, 617-633 



Leucine zipper pattern. 

amino acids 469-491, 476-498 



N-glycosylation site. 

amino acids 20-24, 75-79, 340-344, 504-508, 542-546, 588-592, 
628-632, 811-815, 832-836, 837-841, 852-856, 896-900 
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FIGURE 71 



CT CCT TAGGTGGAAACCCT GGGAGTAGAGTACT GACAGCAAAGACCG GGAAAGAC CATACGT CCCCGGGCAGGGGTGA 
C^CAGGTGTC^TCTTTTTGATCTCGTGTGTGGCTGCCTTCCTATTTCAAGGAAAGACGCCAAGGTAATTTTGACCCA 
GAGGAGCAAT GATGTAGCCACCT CCTAACCTTCC CTT CTT GAAC CCCCAGTTAT GCCAGGAT TTACTAGAGAGT GTCA 
ACTCAACCAGCAAGCGGCTCCTTCGGCTTAACTTGTGGTTGGAGGAGAGAACCTTTGTGGGGCTGCGTTCTCTTAGCA 
GTGCTCAGAAGTGACTTGCCTGAGGGTGGACCAGAAGAAAGGAAAGGTCCCCTCTTGCTGTTGGCTGCACATCAGGAA 
GGCT GT GAT GGGAAT GAAGGT GAAAACT T G GAGAT T T CACTT CAGTCAT TGCTTCTGCCT GCAAGAT CAT CCTTTAAA 
AGTAGAGAAGCT GCT CT GT GTGGTGGTTAACT CCAAGAGGCAGAACT CGTTCTAGAAGGAAAT GGAT GCAAGCAGCT C 
CGGGGGCCCCAAACGCATGCTTCCTGTGGTCTAGCCCAGGGAAGCCCTTCCGTGGGGGCCCCGGCTTTGAGGGATGCC 
ACCGGTTCTGGACGCATGGCTGATTCCTGAATGATGATGGTTCGCCGGGGGCTGCTTGCGTGGATTTCCCGGGTGGTG 
GTTTTGCTGGTGCTCCTCTGCTGTGCTATCTCTGTCCTGTACATGTTGGCCTGCACCCCAAAAGGTGACGAGGAGCAG 
CTGGCACTGCCCAGGGCCAACAGCCCCACGGGGAAGGAGGGGTACCAGGCCGTCCTTCAGGAGTGGGAGGAGCAGCAC 
CGCAACTACGTGAGCAGCCTGAAGCGGCAGATCGCACAGCTCAAGGAGGAGCTGCAGGAGAGGAGTGAGCAGCTCAGG 

fa AAT GGGCAGTACCAAGCCAGCGAT GCTGCT GGC CT GGGT CTGGACAGGAGCCCCC CAGAGAAAACCCAGGCC GACCT C 

p % CTGGCCTTCCTGCACTCGC^GGTGGACAAGGC^GAGGTGAATGCTGGCGTC^ 

W CCTT T CGATAGCTTTACT CTACAGAAGGT GTAC CAGCT GGAGACTGGCCTTACCC GCCACCCCGAGGAGAAGCCTGT G 

pi AGGAAGGACAAGCGGGATGAGTTGGTGGAAGCCATTGAATCAGCCTTGGAGACC^ 

X CCCAATCACC GT CCT TACACGGCCT CT GAT TTCATAGAAGGGAT CTACCGAACAGAAAGG GACAAAGGGACATTGTAT 

*f 5 GAG CTCACCTT CAAAGGGGACCACAAACACGAAT T CAAAC GGCT CAT CTTATTT CGACCATT CAGCCCCATCAT GAAA 

|i| GT GAAAAAT GAAAAGCTCAACATGGCCAACACGCTTAT CAAT GT TAT CGT GCCT CTAG CAAAAAGGGTGGACAAGTT C 

CGGCAGTTC ATGCAGAATT T CAGGGAGATGTGCATT GAGCAGGAT GGGAGAGT CCAT CT CACT GT TGTTTACTTT GGG 
AAAGAAGAAATAAATGAAGTCAAAGGAATACTTGAAAACACTTCCAAAGCTGCCAACTTCAGGAACTTTACCTTCATC 
CAGCTGAATGGAGAATTTTCTCGGGGAAAGGGACTTGATGTTGGAGCCCGCTTCTGGAAGGGAAGCAACGTCCTTCTC 
TTTT TCT GTGAT GT GGACAT CTACTT CACATCT GAAT T CCTCAATACGT GTAGGCTGAATACACAGCCAGGGAAGAAG 
GTAT TTTATC CAGTT CTTT TCAGT CAGTACAAT CCTGGCATAATATACGGCCACCATGAT GCAGTCCCT CCCT T GGAA 
CAGCAGCTGGT CATAAAGAAGGAAACT GGATTTT GGAGAGACTTT GGATTTGGGATGACGTGT CAGTAT C GGT CAGAC 
r TT CAT CAATATAGGT GGGT TT GAT CTGGACAT CAAAGGCT GG GGCGGAGAGGAT GTG CACCTTTATCGCAAGTATCT C 

CACAGCAAC CTCATAGTGGTAC GGACGCCT GTGCGAGGACTCT T CCAC CT CTGGCATGAGAAGC GCT GCATGGACGAG 
CT GAC CCCC GAGCAGTACAAGATGT GCAT GCAGT C CAAGGCCAT GAAC GAGGCAT CC CACGGC CAGCT GGGCAT GCT G 
GT GTT CAGGCACGAGATAGAGGCT CACCTT CGCAAACA.GAAACAGAAGACAAGTAGCAAAAAAACATGAACT CCCAGA 
GAAGGAT T GT GGGAGACACTTTTT CTTT C CTTTTGCAATTACT GAAAGT GGCT GCAACAGAGAAAAGACTTC CATAAA 
GGACGACAAAAGAAT TGGACTGAT GGGT CAGAGAT GAGAAAGCCT CCGAT TT CT CTCT GTTGGGCTT TT TACAACAGA 
AATCAAAATCTCCGCTTTGCCTGCAAAAGTAACCCAGTTGCACCCTGTGAAGTGTCTGACAAAGGCAGAATGCTTGTG 
AGATTATAAGCCTAATGGTGTGGAGGTTTTGATGGTGTTTACAATACACTGAGACCTGTTGTTTTGTGTGCTCATTGA 
AATATTC^TGATTTAAGAGCAGTTTTGTAAAAAATTCATTAGCATGAAAGGCAAGCATATTTCTCCTCATATGAATGA 
GCCTATCAGCAGGGCTCTAGTTTCTAGGAATGCTAAAATATCAGAAGGCAGGAGAGGAGATAGGCTTATTATGATACT 
AGTGAGTACATTAAGTAAAATAAAATGGACCAGAAAAGAAAAGAAACCATAAATATCGTGTCATATTTTCCCCAAGAT 
TAACCAAAAATAATCTGCTTATCTTTTTGGTTGTCCTTTTAACTGTCTCCGTTTTTTTCTTTTATTTAAAAATGCACT 
TTTT TT CCCT TGT GAGTTATAGT CT GCTTATT TAATTACCACT TT GCAAGCCTTACAAGAGAGCACAAGTTGGCCTAC 
ATTT TTATATTTTTTAAGAAGATACTTT GAGAT GCATTAT GAGAACT TT CAGTT CAAAGCATCAAATT GATGCCATAT 
CCAAGGACAT GCCAAATGCT GATTCTGT CAGGCACTGAAT GT CAGGCATT GAGACATAGGGAAGGAAT GGTT T GTACT 
. AATACAGACGTACAGATACTTT CTCTGAAGAGTATTT T CGAAGAGGAGCAACT GAACACT GGAGGAAAAGAAAAT GAC 
ACTT T CT GCT TT ACAGAAAAGGAAACT CAT TCAGACT GGT GATAT CGTGATGTACCTAAAAGTCAGAAACCACATTTT 
CTCCTCAGAAGTAGGGACCGCTTTCTTACCTGTTTAAATAAACCAAAGTATACCGTGTGAACCAAACAATCTCTTTTC 
AAAACAGGGT GCTC CTCCT GGCTT CTGGCT TCCATAAGAAGAAAT GGAGAAAAATATATATATATATAT ATAT ATTGT 
GAAAGAT CAATCCAT CTGCCAGAAT CTAGTGGGAT GGAAGTTT TT GCTACAT GT TATC CACCC CAGGC CAGGT GGAAG 
TAACT GAATTAT TTT TTAAATTAAGCAGTTCTACT CAATCACCAAGAT GCTT CT GAAAATTGCAT TTTATTACCATTT 
CAAACTATTTTTTAAAAATAAATACAGTT7^ACATAGAGTGGTTTCTTCATTCATGTGAAAATTATTAGCCAGCACCAG 
ATGCATGAGCTAATTATCTCTTTGAGTCCTTGCTTCTGTTTGCTCACAGTAAACTC^TTGTTTAAAAGCTTCAAGAAC 
ATT CAAGCT GTT GGT GT GT TAAAAAAT GCATTGTATT GAT TT GTACT GGTAGTTTAT GAAATTTAATTAAAACACAGG 
CCATGAATGGAAGGTGGTATTGCACAGCTAATAAAATATGATTTGTGGATATGAA 



m 
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FIGURE 72 



MMVRRGLLAWI SRVWLLVLLCCAI S VLYMLACT PKGDEEQLALPRANSPTGKEGYQAVLQEWE 

EQHRNYVSSLKRQIAQLKEELQERSEQLRNGQYQASDAAGLGLDRSPPEKTQADLLAFLHSQVDK 

AEVNAGVKLATEYAAVPFDSFTLQKVYQLETGLTRHPEEKPVRKDKRDELVEAIESALETLNNPA 

ENSPNHRPYTASDFIEGIYRTERDKGTLYELTFKGDHKHEFKRLILFRPFSPIMKVKNEKLNMAN 

TL INVI VPLAKRVDKFRQ FMQNFREMC I EQDGRVHLT WYFGKEE INE VKG I LENTSKAANFRNF 

TFIQLNGEFSRGKGLDVGARFWKGSNVLLFFCDVDIYFTSEFLNTCRLNTQPGKKVFYPVLFSQY 

NPGIIYGHHDAVPPLEQQLVIKKETGFWRDFGFGMTCQYRSDFINIGGFDLDIKGWGGEDVHLYR 

KYLHSNLIWRTPVRGLFHLWHEKRCMDELTPEQYKM^ 

RKQKQKT S S KKT 

Important features: 
Signal peptide: 

amino acids 1-27 

N-glycosylation sites. 

amino acids 315-319, 324-328 

N-myristoylation sites. 

amino acids 96-102, 136-142, 212-218, 311-317, 339-345, 393-399 

Amidation site. 

amino acids 377-381 
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FIGURE 7a 



GAGAC TGCAGAGGGAGATAAAGAGAGAGGGCAAAGAGGCAGCAAGAGAT T TGT CCT GGGGATCCA 
GAAACCCAT GAT ACCCTACT GAACACCGAATCCC CT GGAAGC CCACAGAGACAGAGACAGCAAGA 
GAAGCAGAGATAAATACACTCACGCCAGGAGCTCGCTCGCTCTCTCTCTCTCTCTCTCACTCCTC 
CCTCCCTCTCTCTCTGCCTGTCCTAGTCCTCTAGTCCTCAAATTCCCAGTCCCCTGCACCCCTTC 
CTGGGACACTATGTTGTTCTCCGCCCTCCTGCTGGAGGTGATTTGGATCCTGGCTGCAGATGGGG 
GTCAACACTGGACGTATGAGGGCCCACATGGTCAGGACCATTGGCCAGCCTCTTACCCTGAGTGT 
GGAAACAATGCCCAGT CGCCCATCGATAT TCAGACAGACAGT GTGACAT T TGACCC TGAT T TGCC 

U 

TGCTCTGCAGCCCCACGGATATGACCAGCCTGGCACCGAGCCTTTGGACCTGCACAACAATGGCC 
m ACACAGTGCAACTCTCTCTGCCCTCTACCCTGTATCTGGGTGGACTTCCCCGAAAATATGTAGCT 
||! GCCCAGCTCCACCTGCACTGGGGTCAGAAAGGATCCCCAGGGGGGTCAGAACACCAGATCAACAG 
III TGAAGCCACAT T TGCAGAGCTCCACAT T GTACAT TATGACT CTGAT TCC TATGACAGCT T GAGTG 

y AGGCTGCTGAGAGGCCTCAGGGCCTGGCTGTCCTGGGCATCCTAATTGAGGTGGGTGAGACTAAG 
fa AATATAGCT TAT GAACACAT T CTGAGTCACT T GCATGAAGT CAGGCATAAAGATCAGAAGACCTC 

I AGTGCCTCCCTTCAACCTAAGAGAGCTGCTCCCCAAACAGCTGGGGCAGTACTTCCGCTACAATG 
Q GCTCGC TCACAACT CCCCCT T GC TACCAGAGTGTGC TCT GGACAGT T T T TTATAGAAGGT CCCAG 

If! ATTTCAATGGAACAGCTGGAAAAGCTTCAGGGGACATTGTTCTCCACAGAAGAGGAGCCCTCTAA 
!*f GCTTCTGGTACAGAACTACCGAGCCCTTCAGCCTCTCAATCAGCGCATGGTCTTTGCTTCTTTCA 
!S TCCAAGCAGGATCCTCGTATACCACAGGTGAAATGCTGAGTCTAGGTGTAGGAATCTTGGTTGGC 
fit TGTCTCTGCCTTCTCCTGGCTGTTTATTTCATTGCTAGAAAGATTCGGAAGAAGAGGCTGGAAAA 
CCGAAAGAGTGT GGTCT T CACCTCAGCACAAGCCACGAC TGAGGCATAAATTCCTT CTCAGATAC 
CATGGATGTGGATGACTTCCCTTCATGCCTATCAGGAAGCCTCTAAAATGGGGTGTAGGATCTGG 
CCAGAAACACTGTAGGAGTAGTAAGCAGATGTCCT CCT T CCCC TGGACATC TC T TAGAGAGGAAT 
GGACCCAGGCTGT CAT T CCAGGAAGAAC TGCAGAGCC T T CAGCCTC TC CAAACATGTAGGAGGAA 
ATGAGGAAATCGCTGTGTTGTTAATGCAGAGANCAAACTCTGTTTAGTTGCAGGGGAAGTTTGGG 
ATATACCCCAAAGTCCTCTACCCCCTCACTTTTATGGCCCTTTCCCTAGATATACTGCGGGATCT 
CTCCTTAGGATAAAGAGT TGCTGTTGAAGTTGTATATT TTTGATCAATATATT TGGAAATTAAAG 
TTTCTGACTTT 
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FIGURE 74 



MLFSALLLEVIWI LAADGGQHWTYEGPHGQDHWPASYPECGNNAQSPIDIQTDSVT FDPDLPALQ 
PHGYDQPGTEPLDLHNNGHTVQLSLPSTLYLGGLPRKYVAAQLHLHWGQKGSPGGSEHQINSEAT 
FAELHIVHYDSDSYDSLSEAAERPQGLAVLGILIEVGETKNIAYEHILSHLHEVRHKDQKTSVPP 
FNLRELLPKQLGQYFRYNGSLTTPPCYQSVLWTVFYRRSQISMEQLEKLQGTLFSTEEEPSKLLV 
QNYRALQPLNQRMVFASFIQAGSSYTTGEMLSLGVGILVGCLCLLLAVYFIARKIRKKRLENRKS 
WFT S AQATTEA 

Important features of the protein: 
Signal peptide: 

amino acids 1-15 

Transmembrane domain: 

amino acids 291-310 

N-glycosylation site. 

amino acids 213-216 

Eukaryo tic- type carbonic anhydrases proteins 

amino acids 197-245, 104-140, 22-69 
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FIGURE 75 

TGCCGCTGCCGCCGCTGCTGCTGTTGCTCCTGGCGGCGCCTTGGGGACGGGCAGTTCCCTGTGTC 
TCTGGTGGTTTGCCTAAACCTGCAAACATCACCTTCTTATCCATCAACATGAAGAATGTCCTACA 
AT GGACT CCAC CAGAGGGTC TTCAAGGAGT TAAAGTT ACTTACAC TG TGCAGTATT T CAT CACAA 
AT T G GC C CAC CAGAGG T G GC AC T GAC TAC AGAT GAGAAG T C CAT TTCTGTTGTCCT GACAGC T CC 
AGAGAAGTGGAAGAGAAATCCAGAAGACCTTCCTGTTTCCATGCAACAAATATACTCCAATCTGA 
AGTATAACGTGTCTGTGTTGAATACTAAATCAAACAGAACGTGGTCCCAGTGTGTGACCAACCAC 
ACGCTGGTGCTCACCTGGCTGGAGCCGAACACTCTTTACTGCGTACACGTGGAGTCCTTCGTCCC 
AGGGCCCCCTCGCCGTGCTCAGCCTTCTGAGAAGCAGTGTGCCAGGACTTTGAAAGATCAATCAT 

0 

pi: CAGAGTTCAAGGCTAAAATCATCTTCTGGTATGTTTTGCCCATATCTATTACCGTGTTTCTTTTT 
m TC T GTGATGGGCTAT TCCAT C TACCGATATATCCACGT TGGCAAAGAGAAACACCCAGCAAAT T T 

%tp I- 

III GAT T TTGATT TAT GGAAAT GAAT TT GACAAAAGAT TCT TTGT GCC T GCTGAAAAAAT CGTGAT TA 

y ACT T TATCACCCT CAAT ATCT CGGATGAT TCTAAAATT TC TCATCAGGATATGAGTT TACTGGGA 

%l AAAAGCAGTGATGTATCCAGCCTTAATGATCCTCAGCCCAGCGGGAACCTGAGGCCCCCTCAGGA 
GGAAGAGGAGGTGAAACATTTAGGGTATGCTTCGCATTTGATGGAAATTTTTTGTGACTCTGAAG 

s. ■ 

C| AAAACACGGAAGGTACT T CTC TCACCCAGCAAGAGT CCCT CAGCAGAACAATACCCCCGGATAAA 

||! ACAGTCAT TGAATATGAAT AT GATGTCAGAACCACT GACAT T T GTGC GGGGCC TGAAGAGCAGGA 

O GCT CAGT TTGCAGGAGGAGGT GTCCACACAAGGAACAT TAT T GGAGT CGCAGGCAGCGTT GGCAG 

JS( TCT T GGGCCCGCAAACGT TACAGTACTCATACACCCCT CAGCTCCAAGAC T TAGAC CCCC T GGCG 

2s CAGGAGCACACAGACTCGGAGGAGGGGCCGGAGGAAGAGCCATCGACGACCCTGGTCGACTGGGA 
TCCCCAAACTGGCAGGCTGTGTATTCCTTCGCTGTCCAGCTTCGACCAGGATTCAGAGGGCTGCG 
AGCCTTCTGAGGGGGATGGGCTCGGAGAGGAGGGTCTTCTATCTAGACTCTATGAGGAGCCGGCT 
CCAGACAGGCCACCAGGAGAAAATGAAACC TATCT CAT GCAAT TCAT GGAGGAATGGGGGT T ATA 
TGTGCAGATGGAAAACTGATGCCAACACTTCCTTTTGCCTTTTGTTTCCTGTGCAAACAAGTGAG 
TCACCCCTTTGATCCCAGCCATAAAGTACCTGGGATGAAAGAAGTTTTTTCCAGTTTGTCAGTGT 
CTGTGAGAATTACTTATTTCTTTTCTCTATTCTCATAGCACGTGTGTGATTGGTTCATGCATGTA 
GGTCTCTTAACAATGATGGTGGGCCTCTGGAGTCCAGGGGCTGGCCGGTTGTTCTATGCAGAGAA 
AGCAGTCAATAAATGTTTGCCAGACTGGGTGCAGAATTTATTCAGGTGGGTGT 
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FIGURE 76 

MSYNGLHQRVFKELKLLTLCSISSQIGPPEVALTTDEKSISWLTAPEKWKRNPEDLPVSMQQIY 
SNLKYWSVLNTKSNRTWSQCVTN 

DQS S E FKAKI I FWYVLP I S I T VFLFSVMGYS I YRYI HVGKEKHPANL I L I YGNE FDKRFFVPAEK 
I VINFI TLNI SDDSKI SHQDMSLLGKS SDVS SLNDPQPS GNLRPPQEEEEYKHLGYASHLME I FC 
DSEENTEGTSLTQQESLSRTIPPDKTVIEYEYDVRTTDICAGPEEQELSLQEEVSTQGTLLESQA 
ALAVLGPQTLQYSYTPQLQDLDPLAQEHTDSEEGPEEEPSTTLVDWDPQTGRLCIPSLSSFDQDS 
EGCEPSEGDGLGEEGLLSRLYEEPAPDRPPGENETYLMQFMEEWGLYVQMEN 

Important features: 
Signal peptide: 

amino acids 1-28 

Transmembrane domain: 

amino acids 140-163 

N-glycosylation sites. 

amino acids 71-74, 80-83, 89-92, 204-207, 423-426 
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FIGURE 77 



GAGGAGCGGGCCGAGGACTCCAGCGTGCCCAGGTCTGGCATCCTGCACTTGCTGCCCTCTGACAC 
CTGGGAAG ATG GCCGGCCCGTGGACCTTCACCCTTCTCTGTGGTTTGCTGGCAGCCACCTTGATC 
CAAGCCACCCTCAGTCCCACTGCAGTTCTCATCCTCGGCCCAAAAGTCATCAAAGAAAAGCTGAC 
ACAGGAGCTGAAGGACCACAACGCCACCAGCATCCTGCAGCAGCTGCCGCTGCTCAGTGCCATGC 
GGGAAAAGCCAGCCGGAGGCATCCCTGTGCTGGGCAGCCTGGTGAACACCGTCCTGAAGCACATC 
ATCT GGC TGAAGGTCAT CACAGCTAACATCC T CCAGCT GCAGGTGAAGCCCTCGGCCAAT GACCA 
GGAGCTGCTAGTCAAGATCCCCCTGGACATGGTGGCTGGATTCAACACGCCCCTGGTCAAGACCA 
TC GT GGAG T T C CACAT GAC GAC T GAGG C C CAAG C CACC AT C CGC AT G GACAC C AG T GCAAGT GGC 
CCCACCCGCCTGGTCCTCAGTGACTGTGCCACCAGCCATGGGAGCCTGCGCATCCAACTGCTGTA 
TAAGCTCTCCTTCCTGGTGAACGCCTTAGCTAAGCAGGTCATGAACCTCCTAGTGCCATCCCTGC 
CCAATCTAGTGAAAAACCAGCTGTGTCCCGTGATCGAGGCTTCCTTCAATGGCATGTATGCAGAC 
CTCCTGCAGCTGGTGAAGGTGCCCATTTCCCTCAGCATTGACCGTCTGGAGTTTGACCTTCTGTA 
TCCTGCCATCAAGGGTGACACCATTCAGCTCTACCTGGGGGCCAAGTTGTTGGACTCACAGGGAA 
AGGTGACCAAGTGGTTCAATAACTCTGCAGCTTCCCTGACAATGCCCACCCTGGACAACATCCCG 
TTCAGCCTCATCGTGAGTCAGGACGTGGTGAAAGCTGCAGTGGCTGCTGTGCTCTCTCCAGAAGA 
W ATTCATGGTCCTGTTGGACTCTGTGCTTCCTGAGAGTGCCCATCGGCTGAAGTCAAGCATCGGGC 
T.! TGAT CAATGAAAAGGC T GCAGATAAGC TGGGATCTACCCAGATCGTGAAGATCCTAACTCAGGAC 

n ACTCCCGAGTTTTTTATAGACCAAGGCCATGCCAAGGTGGCCCAACTGATCGTGCTGGAAGTGTT 
fill TCCCTCCAGTGAAGCCCTCCGCCCTTTGTTCACCCTGGGCATCGAAGCCAGCTCGGAAGCTCAGT 
TT TACACCAAAGG T GACCAACT TATAC TCAAC T TGAATAACAT CAGC TC TGATCGGAT CCAGC TG 
AT GAACT CT GGGAT TGGCTGG T TCCAACC TGATGT T CT GAAAAACATCATCAC T GAGAT CAT C CA 
CTCCATCCTGCTGCCGAACCAGAATGGCAAATTAAGATCTGGGGTCCCAGTGTCATTGGTGAAGG 
CCTTGGGATTCGAGGCAGCTGAGTCCTCACTGACCAAGGATGCCCTTGTGCTTACTCCAGCCTCC 
TTGTGGAAACCCAGCTCTCCTGTCTCCCAGTGAAGACTTGGATGGCAGCCATCAGGGAAGGCTGG 
GTCCCAGCTGGGAGTATGGGTGTGAGCTCTATAGACCATCCCTCTCTGCAATCAATAAACACTTG 
CCTGTGAAAAA 



X-4S, 



h! 



ft 
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FIGURE 78 



til 

o 

r....~; 



MAGPWT FTLLCGLLAAT L I QATLS P TAVL I LGPKVI KEKLTQELKDHNAT S ILQQLPLLSAMREK 
PAGGI PVLGSLVNTVLKH 1 1 WLKVI TAN ILQLQVKPSANDQELLVKI PLDMVAGFNTPLVKT I VE 
FHMTTEAQATIBMDTSASGPTRLVLSDCATSHGSLRIQLLYKLSFLTOALAKQVMNLLVPSLPNL 
VKNQLC P VI EAS FNGMYADLLQLVKVP I SLS I DRLE FDLL Y PAI KGDT I QL YLGAKLLDSQGKVT 
KWFNNSAASLTMPTLDNIPFSLIVSQDWK^^ 

EKAADKLGSTQIVKILTQDTPEFFIDQGHAKVAQLIVLEVFPSSEALRPLFTLGIEASSEAQFYT 
KGDQLILNLNNISSDRIQLMNSGIGWFQPDVLKNIITEIIHSILLPNQNGKLRSGVPVSLVKALG 
FEAAESSLTKDALVLTPASLWKPSSPVSQ 



yi Important features of the protein: 

fsf 

'r"Z Signal peptide: 

3 !' amino acids 1-21 



N-glycosylation sites* 

amino acids 48-51, 264-267, 401-404 

Glycosaminoglycan attachment site. 

amino acids 412-415 

LBP / BPI / CETP family proteins. 

amino acids 407-457 
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FIGURE 7Q 

GAGAGAAGT CAGCC T GGCAGAGAGACTC TGAAAT GAGGGAT TAGAGGTGTTCAAGGAGCAAGAGC 
TTCAGCCTGAAGACAAGGGAGCAGTCCCTGAAGACGCTTCTACTGAGAGGTCTGCCATGGCCTCT 
CTTGGCCTCCAACTTGTGGGCTACATCCTAGGCCTTCTGGGGCTTTTGGGCACACTGGTTGCCAT 
GCTGCTCCCCAGCTGGAAAACAAGTTCTTATGTCGGTGCCAGCATTGTGACAGCAGTTGGCTTCT 
CCAAGGGCCT CT GGAT GGAATGTGCCACACACAGCACAGGCAT CACCCAGTGT GACATCTATAGC 
ACCCTTCTGGGCCTGCCCGCTGACATCCAGGCTGCCCAGGCCATGATGGTGACATCCAGTGCAAT 
CTCCTCCCTGGCCTGCATTATCTCTGTGGTGGGCATGAGATGCACAGTCTTCTGCCAGGAATCCC 
^ GAGCCAAA.GACAGAGTGGCGGTAGCAGGTGGAGTCTTTTTCATCCTTGGAGGCCTCCTGGGATTC 
P ATTCCTGTTGCCTGGAATCTTCATGGGATCCTACGGGACTTCTACTCACCACTGGTGCCTGACAG 
01 CATGAAATTTGAGATTGGAGAGGCTCTTTACTTGGGCATTATTTCTTCCCTGTTGTCCCTGATAG 
W. CTGGAATCATCCTCTGCTTTTCCTGCTCATCCCAGAGAAATCGCTCCAACTACTACGATGCCTAC 
* 4 CAAGCCCAACCTCTTGCCACAAGGAGCTCTCCAAGGCCTGGTCAACCTCCCAAAGTCAAGAGTGA 
hi GTTCAATTCCTACAGCCTGACAGGGTATGTGTGAAGAACCAGGGGCCAGAGCTGGGGGGTGGCTG 
g GGTCTGTGAAAAACAGTGGACAGCACCCCGAGGGCCACAGGTGAGGGACACTACCACTGGATCGT 
0 GTCAGAAGGT GCT GCT GAGGATAGAC T GAC TT T GGCCAT TGGAT T GAGCAAAGGCAGAAAT GGGG 

in 

GCTAGTGTAACAGCATGCAGGTTGAATTGCCAAGGATGCTCGCCATGCCAGCCTTTCTGTTTTCC 
TCACCTTGCTGCTCCCCTGCCCTAAGTCCCCAACCCTCAACTTGAAACCCCATTCCCTTAAGCCA 
m GGACTCAGAGGATCCCTTTGCCCTCTGGTTTACCTGGGACTCCATCCCCAAACCCACTAATCACA 
HI TCCCACTGACTGACCCTCTGTGATCAAAGACCCTCTCTCTGGCTGAGGTTGGCTCTTAGCTCATT 
GCTGGGGATGGGAAGGAGAAGCAGTGGCTTTTGTGGGCATTGCTCTAACCTACTTCTCAAGCTTC 
CCTCCAAAGAAACTGATTGGCCCTGGAACCTCCATCCCACTCTTGTTATGACTCCACAGTGTCCA 
GACTAATT TGTGCAT GAACT GAAATAAAACCATCCTACGG TAT CCAGGGAACAGAAAGCAGGATG 
CAGGATGGGAGGACAGGAAGGCAGCCT GGGACAT T TAAAAAAATA 
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FIGURE 80 



MASLGLQLVGYILGLLGLLGTLVAMLLPSWKTSSYVGASIVTAVGFSKGLWMECATHSTGITQCD 
I YS TLLGLPAD I QAAQAMMVT S SAI S SLACI IS WGMRCT VFCQESRAKDRVAVAGGVFFI LGGL 
LGFIPVAWNLHGILRDFYSPLVPDSMKFEIGEALYLGIISSLFSLIAGIILCFSCSSQRNRSNTYY 
DAYQAQPLATRSSPRPGQPPKVKSEFNSYSLTGYV 

Important features of the protein: 
Signal peptide: 

amino acids 1-24 



1 



Transmembrane domains: 



I 

:: 



amino acids 82-102, 117-140, 163-182 



N-glycosylation site. 



amino acids 190-193 



FMP-22 / EMP / MP20 family proteins. 



amino acids 4 6-59 
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FIGURE 81 



CCCACGCGTCCGCGCCTCTCCCTTCTGCTGGACCTTCCTTCGTCTCTCCATCTCTCCCTCCTTTC 
CCCGCGTTCTCTTTCCACCTTTCTCTTCTTCCCACCTTAGACCTCCCTTCCTGCCCTCCTTTCCT 
GCCCACCGCTGCTTCCTGGCCCTTCTCCGACCCCGCTCTAGCAGCAGACCTCCTGGGGTCTGTGG 
GTTGATCTGTGGCCCCTGTGCCTCCGTGTCCTTTTCGTCTCCCTTCCTCCCGACTCCGCTCCCGG 
ACCAGCGGCCTGACCCTGGGGAAAGGATGGTTCCCGAGGTGAGGGTCCTCTCCTCCTTGCTGGGA 
CTCGCGCTGCTCTGGTTCCCCCTGGACTCCCACGCTCGAGCCCGCCCAGACATGTTCTGCCTTTT 
CCATGGGAAGAGATACTCCCCCGGCGAGAGCTGGCACCCCTACTTGGAGCCACAAGGCCTGATGT 
ACTGCCTGCGCTGTACCTGCTCAGAGGGCGCCCATGTGAGTTGTTACCGCCTCCACTGTCCGCCT 
GTCCACTGCCCCCAGCCTGTGACGGAGCCACAGCAATGCTGTCCCAAGTGTGTGGAACCTCACAC 
TCCCTCTGGACTCCGGGCCCCACCAAAGTCCTGCCAGCACAACGGGACCATGTACCAACACGGAG 
AGATCTTCAGTGCCCATGAGCTGTTCCCCTCCCGCCTGCCCAACCAGTGTGTCCTCTGCAGCTGC 
ACAGAGGGCCAGATCTACTGCGGCCTCACAACCTGCCCCGAACCAGGCTGCCCAGCACCCCTCCC 
AC T GCCAGACT CCT GCT GCCAAGCC TGCAAAGAT GAGGCAAGTGAGCAATCGGATGAAGAGGACA 
GTGTGCAGTCGCTCCATGGGGTGAGACATCCTCAGGATCCATGTTCCAGTGATGCTGGGAGAAAG 
AGAGGCCCGGGCACCCCAGCCCCCACTGGCCTCAGCGCCCCTCTGAGCTTCATCCCTCGCCACTT 
CAGACCCAAGGGAGCAGGCAGCACAACTGTCAAGATCGTCCTGAAGGAGAAACATAAGAAAGCCT 
GTGTGCATGGCGGGAAGACGTACTCCCACGGGGAGGTGTGGCACCCGGCCTTCCGTGCCTTCGGC 
CCCTTGCCCTGCATCCTATGCACCTGTGAGGATGGCCGCCAGGACTGCCAGCGTGTGACCTGTCC 
CACCGAGTACCCCTGCCGTCACCCCGAGAAAGTGGCTGGGAAGTGCTGCAAGATTTGCCCAGAGG 
ACAAAGCAGACCCTGGCCACAGTGAGATCAGTTCTACCAGGTGTCCCAAGGCACCGGGCCGGGTC 
CTCGTCCACACATCGGTATCCCCAAGCCCAGACAACCTGCGTCGCTTTGCCCTGGAACACGAGGC 
CTCGGACTTGGTGGAGATCTACCTCTGGAAGCTGGTAAAAGATGAGGAAACTGAGGCTCAGAGAG 
GT GAAGTACC T GGCCCAAGGCCACACAGCCAGAAT CTTCCAC T TGAC TCAGATCAAGAAAGTCAG 
GAAGCAAGAC TTCCAGAAAGAGGCACAGCAC T T CCGACTGCTCGC TGGCCCCCACGAAGGT CACT 
GGAACGTCTTCCTAGCCCAGACCCTGGAGCTGAAGGTCACGGCCAGTCCAGACAAAGTGACCAAG 
ACATAACAAAGAC C TAA CAGT T GCAGATATGAGCT GTATAAT TGT TGTTAT TAT AT AT TAATAAA 
TAAGAAGTTGCATTACCCTCAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 82 



MVPEVRVLSSLLGLAiLWFPLDSHARARPDMFCLFHGKRYSPGESWHPYLEPQGLMYCLRCTCSE 
GAHVSCYRLHCPPVHCPQPVTEPQQCCPKCVEPHTPSGLRAPPKSCQHNGTMYQHGEIFSAHELF 
PSRLPNQCVLCSCTEGQIYCGLTTCPEPGCPAPLPLPDSCCQACKDEASEQSDEEDSVQSLHGVR 
HPQDPCSSDAGRKRGPGTPAPTGLSAPLSFIPRHFRPKGAGSTTVKIVLKEKHKKACVHGGKTYS 
HGEVWHPAFRAFGPLPCILCTCEDGRQDCQRVTCPTEYPCRHPEKVAGKCCKICPEDKADPGHSE 
ISSTRCPKAPGRVLVHTSVSPSPDNLRRFALEHEASDLVEIYLWKLVKDEETEAQRGEVPGPRPH 
SQNLPLDSDQESQEARLPERGTALPTARWPPRRSLERLPSPDPGAEGHGQSRQSDQDITKT 

Signal peptide: 

amino acids 1-25 
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FIGURE 8s 



m 
ill 

si. 

S r 



GACAGCTGTGTCTCGATGGAGTAGACTCTCAGAACAGCGCAGTTTGCCCTCCGCTCACGCAGAGCCTCTCC 
GTGGCTTCCGCACCTTGAGCATTAGGCCAGTTCTCCTCTTCTCTCTAATCCATCCGTCACCTCTCCTGTCA 
TCCGTT T C CAT GCC GT GAGGTCC ATT CACAGAACACAT CC ATGG CT CTCATGCT CAGT TT GGTT CT GAGT C 
TCCTCAAGCT GGGATCAGGGCAGTGGCAGGT GTT TGGGCCAGACAAGCCT GT CCAGGCCT T GGT GGGGGAG 
GACGCAGCATTGTCCTGTTTCCTGTCTCCTAAGACCAATGCAGAGGCCATGGAAGTGCGGTTCTTCAGGGG 
CCAGTTCTCTAGCGTGGTCCACCTCTACAGGGACGGGAAGGACCAGCCATTTATGCAGATGCCACAGTATC 
AAGGC AGGACAAAACT GGTGAAGGAT TCTAT T GCGGAGGGGCGCAT CTCT CT GAGGCT GG AAAACATT ACT 
GT GT TGGATGCTGGCCT CTATGGGTGCAGGAT T AGTTCCC AGTCTT ACTACC AGAAGG CCATCT GGGAGCT 
AC AGGT GT C AGCACTGGGCTCAGT TCCT CT CAT T TCCATC ACGGGAT ATGTT GAT AGAGACAT CCAGCT AC 
TCTGTCAGTCCTCGGGCTGGTTCCCCCGGCCCACAGCGAAGTGGAAAGGTCCACAAGGACAGGATTTGTCC 
ACAGACTCCAGGACAAAC AGAGACAT GC AT GGCCTGTT TGATGTGGAGAT CTCTCT GACCGT CC AAGAGAA 
CGCC GGGAGC ATAT CCT GTT CC ATGCGGC AT GCT C ATCTGAGCCGAGAGGTGGAAT CC AGGGT ACAGAT AG 
GAGATACCTTTTTCGAGCCTATATCGTGGCACCTGGCTACCAAAGTACTGGGAATACTCTGCTGTGGCCTA 
TT T T TTGGCATT GT TGGACT GAAG AT TTTCT T CT CCAAAT TCCAGT GGAAAATCCAGGCGGAACT GGACT G 
GAGAAGAAAGCACGGACAGGCAGAATTGAGAGACGCCCGGAAACACGCAGTGGAGGTGACTCTGGATCCAG 



PI AGAC GGCT CACCCGAAGCT CTGCGTT TCTGAT CT GAAAACTGTAACCC AT AGAAAAGCTCCC CAGGAGGT G 

Ill CCTC ACTCTGAGAAGAGAT T T AC AAGGAAGAGTGTGGT GGCTTCTC AGAGTT TCC AAGCAGGGAAACAT T A 

CI CT GGGAGGTGGACGGAGGAC AC AATAAAAGGT GGCGCGTGGG AGTGTGCCGGGAT GAT GTGGAC AGGAGGA 

111 AGGAGT ACGT GACT TT GTCT CC CG AT CATGGGT ACTGGGTCCTC AGACTGAATGGAGAAC AT TT GT AT TTC 

W ACAT TAAATCCCCGTT TT AT CAGC GT CT TCCCCAGGACCCCACCTACAAAAAT AGGGGT CTT CCTGGACT A 

m 

TGAGT GTGGGAC CATCTC CT T CTT CAAC AT AAAT GACCAGTCC CTT ATT T AT ACCCTGAC ATGT CGGTTT G 
AAGGCTTATT GAGGCCCT AC AT T GAGTATCCGTCCT ATAATGAG CAAAATGGAACT CCCATAGT CATCTGC 
CCAGTCACCCAGGAATCAGAGAAAGAGGCCTCTTGGCAAAGGGCCTCTGCAATCCCAGAGACAAGCAACAG 
TGAGTCCTCCTCACAGGCAACCACGCCCTTCCTCCCCAGGGGTGAAATGTAGGATGAATCACATCCCACAT 
TCTT CTTT AGGGAT AT T AAGGT CTCT CT CCCAGATCCAAAGTC CCGCAGC AGCCGGCC AAGGTGGC TTCC A 
GATGAAGGGGGACTGGCCTGTCCACATGGGAGTCAGGTGTCATGGCTGCCCTGAGCTGGGAGGGAAGAAGG 
CTGACATTACATTTAGTTTGCTCTCACTCCATCTGGCTAAGTGATCTTGAAATACCACCTCTCAGGTGAAG 
AACC GTCAGG AATT CCCATCT C AC AG GC TGTGGT GT AGATT AAGTAGACAAGGAAT GT GAAT AATGCT T AG 
AT CT TATTGAT G AC AGAGTGT AT CCT AATGGT TT GTTCATT AT ATT AC ACTT TCAGTAAAAAAA 
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FIGURE 84 



MALMLSLVLSLLKLGSGQWQVFGPDKPVQALVGEDAAFSCFLSPKTNAEAMEVRFFRGQFSSWH 
LYRDGKDQPFMQMPQYQGRTKLVKDSIAEGRISLRLENITVLDAGLYGCRISSQSYYQKAIWELQ 
VSALGS VPL I S I TGYVDRDI QLLCQS S GWFPRPTAKWKGPQGQDLS TDSRTNRDMHGLFDVE I SL 
TVQENAGSISCSMRHAHLSREVESRVQIGDTFFEPISWHLATKVLGILCCGLFFGIVGLKIFFSK 
FQWKIQAELDWRRKHGQAELRDARKHAVEVTLDPETAHPKLCVSDLKTVTHRKAPQEVPHSEKRF 
TRKSWASQSFQAGKHYWEVDGGHNKRWRVGVCRDDVDRRKEYVTLSPDHGYWVLRLNGEHLYFT 
LNPRFISVFPRTPPTKIGVFLDYECGTISFFNINDQSLIYTLTCRFEGLLRPYIEYPSYNEQNGT 
PIVICPVTQESEKEASWQRASAIPETSNSESSSQATTPFLPRGEM 

Signal peptide: 

amino acids 1-17 

Transmembrane domain: 

amino acids 239-255 
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FIGURE 8 s 



AACAGACGTTCCCTGGCGGCCCTGGCACCTCTAACCCCAGACATGCTGCTGCTGCTGCTGCCCCT 
GC TC TGGGGGAGGGAGAGGGCGGAAGGACAGACAAGT AAAC TGCT GACGATGCAGAGTT CCGT GA 
CGGTGCAGGAAGGCCTGTGTGTCCATGTGCCCTGCTCCTTCTCCTACCCCTCGCATGGCTGGATT 
TACCCTGGCCCAGTAGTTCATGGCTACTGGTTCCGGGAAGGGGCCAATACAGACCAGGATGCTCC 
AGTGGCCACAAACAACCCAGCTCGGGCAGTGTGGGAGGAGACTCGGGACCGATTCCACCTCCTTG 
GGGACCCACATACCAAGAAT T GCACCC TGAGCAT CAGAGAT GCCAGAAGAAG TGAT GCGGGGAGA 
TAC TTC TT TCGTAT GGAGAAAGGAAGTATAAAAT GGAAT TATAAACAT CACCGGCT C TC T GTGAA 
TGTGACAGCCTTGACCCACAGGCCCAACATCCTCATCCCAGGCACCCTGGAGTCCGGCTGCCCCC 
AGAATCTGACCTGCTCTGTGCCCTGGGCCTGTGAGCAGGGGACACCCCCTATGATCTCCTGGATA 
GGGACCTCCGTGTCCCCCCTGGACCCCTCCACCACCCGCTCCTCGGTGCTCACCCTCATCCCACA 
GCCCCAGGACCATGGCACCAGCCTCACCTGTCAGGTGACCTTCCCTGGGGCCAGCGTGACCACGA 



cr 



ACAAGACCGTCCATCTCAACGTGTCCTACCCGCCTCAGAACTTGACCATGACTGTCTTCCAAGGA 



GACGGCACAGTATCCACAGTCTTGGGAAATGGCTCATCTCTGTCACTCCCAGAGGGCCAGTCTCT 
GCGCCTGGTCTGTGCAGTTGATGCAGTTGACAGCAATCCCCCTGCCAGGCTGAGCCTGAGCTGGA 
GAGGCCTGACCCTGTGCCCCTCACAGCCCTCAAACCCGGGGGTGCTGGAGCTGCCTTGGGTGCAC 
CTGAGGGATGCAGCTGAATTCACCTGCAGAGCTCAGAACCCTCTCGGCTCTCAGCAGGTCTACCT 
GAACGTCTCCCTGCAGAGCAAAGCCACATCAGGAGTGACTCAGGGGGTGGTCGGGGGAGCTGGAG 
CCACAGCCCTGGTCTTCCTGTCCTTCTGCGTCATCTTCGTTGTAGTGAGGTCCTGCAGGAAGAAA 
TCGGCAAGGCCAGCAGCGGGCGTGGGAGATACGGGCATAGAGGATGCAAACGCTGTCAGGGGTTC 
AGCCTCTCAGGGGCCCCTGACTGAACCTTGGGCAGAAGACAGTCCCCCAGACCAGCCTCCCCCAG 
CTTCTGCCCGCTCCTCAGTGGGGGAAGGAGAGCTCCAGTATGCATCCCTCAGCTTCCAGATGGTG 
AAGCCTTGGGACTCGCGGGGACAGGAGGCCACTGACACCGAGTACTCGGAGATCAAGATCCACAG 
ATGAGAAAC TGCAGAGAC TCACCC TGAT T GAGGGATCACAGCCCCTCCAGGCAAGGGAGAAGT CA 
GAGGCTGAT T C T TGTAGAAT TAACAGCCC TCAACGTGAT GAGC TAT GATAACACTAT GAAT T ATG 
TGCAGAGTGAAAAGCACACAGGC T T TAGAGTCAAAGT AT CT CAAACCTGAATCCACACT GT GC CC 
TCCCTTTTATTTTTTTAACTAAAAGACAGACAAATTCCTA 



APPJD= 
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FIGURE 86 

MLLLLLPLLWGRERAEGQTSKLLTMQSSVTVQEGLCVHVPCSFSYPSHGWIYPGPWHGYWFREG 
ANTDQDAPVATNNPARAVWEETRDRFHLLGDPHTKNCTLSIRDARRSDAGRYFFRMEKGSIKWNY 
KHHRLSVNVTALTHRPNILIPGTLESGCPQNLTCSVPWACEQGTPPMISWIGTSVSPLDPSTTRS 
SVLTLIPQPQDHGTSLTCQVTFPGASVTTNKTVHLNVSYPPQNLTMTVFQGDGTVSTVLGNGSSL 
SLPEGQSLRLVCAVDAVDSNPPARLSLSWRGLTLCPSQPSNPGVLELPWVHLRDAAEFTCRAQNP 
LGSQQVYLNVSLQSKATSGVTQGWGGAGATALVFLSFCVIFVWRSCRKKSARPAAGVGDTGIE 
DANAVRGSASQGPLTEPWAEDSPPDQPPPASARSSVGEGELQYASLSFQMVKPWDSRGQEATDTE 
YSEIKIHR 

Signal peptide: 

amino acids 1-15 

Transmembrane domain: 

amino acids 351-370 
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FIGURE 87 

AGAAAGC TGCACT C T GT TGAGC TCCAGGGCGCAGTGGAGGGAGGGAGTGAAGGAGC T C T C TGTAC 
CCAAGGAAAG T GCAGCT GAGAC TCAGACAAGAT TACA ATGA ACCAAC TCAGCT T CCTGCTGT TTC 
TCATAGCGACCACCAGAGGATGGAGTACAGATGAGGCTAATAC TTAC TTCAAGGAATGGACCTGT 
TCTTCGTCTCCATCTCTGCCCAGAAGCTGCAAGGAAATCAAAGACGAATGTCCTAGTGCATTTGA 
TGGCCTGTATTTTCTCCGCACTGAGAATGGTGTTATCTACCAGACCTTCTGTGACATGACCTCTG 
GGGGTGGCGGCTGGACCCTGGTGGCCAGCGTGCATGAGAATGACATGCGTGGGAAGTGCACGGTG 
GGCGATCGCTGGTCCAGTCAGCAGGGCAGCAAAGCAGACTACCCAGAGGGGGACGGCAACTGGGC 
CAAC TACAACACC T T TGGAT C TGCAGAGGCGGCCACGAGCGATGACTACAAGAACCC TGGC TACT 
ACGACATCCAGGGCAAGGACCTGGGCATCTGGCACGTGCCCAATAAGTCCCCCATGCAGCACTGG 
AGAAACAGCTCCCTGCTGAGGTACCGCACGGACACTGGCTTCCTCCAGACACTGGGACATAATCT 
GT T TGGCATC TACCAGAAATATCCAGTGAAATAT GGAGAAGGAAAGTGT TGGACTGACAACGGCC 
CGGTGATCCCTGTGGTCTATGATTTTGGCGACGCCCAGAAAACAGCATCTTATTACTCACCCTAT 
GGCCAGCGGGAAT T CACT GC GGGAT TTGTTCAGT TCAGGGTAT TTAATAACGAGAGAGCAGCCAA 
CGCCTTGTGTGCTGGAATGAGGGTCACCGGATGTAACACTGAGCATCACTGCATTGGTGGAGGAG 
GATAC T T TCCAGAGGCCAGT CCCCAGCAGTGT GGAGAT T T T TC TGGT T T TGAT TGGAGT GGATAT 
GGAACTCATGTTGGTTACAGCAGCAGCCGTGAGATAACTGAGGCAGCTGTGCTTCTATTCTATCG 
TTGAGAGTTTTGTGGGAGGGAACCCAGACCTCTCCTCCCAACCATGAGATCCCAAGGATGGAGAA 
CAAC T TACCCAGT AGCTAGAATGTTAATGGCAGAAGAGAAAACAATAAATCATATT GAC TCAAGA 
AAAAAA 
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FIGURE 88 



If! 



MNQLSFLLFLIATTRGWSTDEANTYFKEWTCSSSPSLPRSCKEIKDECPSAFDGLYFLRTENGVI 
YQTFCDMTSGGGGWTLVASVHENDMRGKCTVGDRWSSQQGSKADYPEGDGNWANYNTFGSAEAAT 
SDDYKNPGYYDIQAKDLGIWHVPNKSPMQHWRNSSLLRYRTDTGFLQTLGHNLFGIYQKYPVKYG 
EGKCWTDNGPVIPVWDFGDAQKTASYYSPYGQREFTAGFVQFRVFNNERiW^MjCAGMRVTGCN 
TEHHC I GGGGY FPEAS PQQCGDFS GFDWS G YGTHVG YS SSREI TEAAVLLFYR 



Important features: 

late 

Signal peptide: 

p| amino acids 1-16 

I 

2 L ; 

N-glycosylation site. 

'% f 

amino acids 163-167 



Glycosaminoglycan attachment sites. 

amino acids 74-78, 289-293 

N-myristoylation sites. 

amino acids 76-82, 115-121, 124-130, 253-259, 292-298 
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FIGURE 8q 



CI! 



rii 



CTAGATTTGTCGGCTTGCGGGGAGACTTCAGGAGTCGCTGTCTCTGAACTTCCAGCCTCAGAGAC 
CGCCGCCCTTGTCCCCGAGGGCCATGGGCCGGGTCTCAGGGCTTGTGCCCTCTCGCTTCCTGACG 
CTCCTGGCGCATCTGGTGGTCGTCATCACCTTATTCTGGTCCCGGGACAGCAACATACAGGCCTG 
CCTGCCTCTCACGTTCACCCCCGAGGAGTATGACAAGCAGGACATTCAGCTGGTGGCCGCGCTCT 
CTGTCACCCTGGGCCTCTTTGCAGTGGAGCTGGCCGGTTTCCTCTCAGGAGTCTCCATGTTCAAC 
AGCACCCAGAGCCTCATCTCCATTGGGGCTCACTGTAGTGCATCCGTGGCCCTGTCCTTCTTCAT 
ATTCGAGCGTTGGGAGTGCACTACGTATTGGTACATTTTTGTCTTCTGCAGTGCCCTTCCAGCTG 
TCACTGAAATGGCTTTATTCGTCACCGTCTTTGGGCTGAAAAAGAAACCCTTCTGATTACCTTCA 
TGACGGGAACCTAAGGACGAAGCCTACAGGGGCAAGGGCCGCTTCGTATTCCTGGAAGAAGGAAG 
GCATAGGCTTCGGTTTTCCCCTCGGAAACTGCTTCTGCTGGAGGATATGTGTTGGAATAATTACG 
TCTTGAGTCTGGGATTATCCGCATTGTATTTAGTGCTTTGTAATAAAATATGTTTTGTAGTAACA 
TTAAGACTTATATACAGTTTTAGGGGACAATTAAAAAAAAAAAA 
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FIGURE qo 

MGRVSGLVPSRFLTLLAHLVWITLFWSRDSNIQACLPLTFTPEEYDKQDIQLVAALSVTLGLFA 
VELAGFLSGVSMFNSTQSL I S IGAHCSAS VALS FFI FERWECTTYWYI FVFCSALPAVTEMALFV 
TVFGLKKKPF 

Transmembrane domain: 

amino acids 12-28 (type II), 51-66, 107-124 
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FIGURE Qi 



CTGGGACCCCGAAAAGAGAAGGGGAGAGCGAGGGGACGAGAGCGGAGGAGGAAGATCCAACTGAC 
TCGCTGCTGCTTCGTGTTCCTGGTGCAGGGTAGCCTCTATCTGGTCATCTGTGGCCAGGATGATG 
GTCCTCCCGGCTCAGAGGACCCTGAGCGTGATGACCACGAGGGCCAGCCCCGGCCCCGGGTGCCT 
CGGAAGCGGGGCCACATCTCACCTAAGTCCCGCCCCATGGCCAATTCCACTCTCCTAGGGCTGCT 
GGCCCCGCCTGGGGAGGCTTGGGGCATTCTTGGGCAGCCCCCCAACCGCCCGAACCACAGCCCCC 
CACCCTCAGCCAAGGTGAAGAAAATCTTTGGCTGGGGCGACTTCTACTCCAACATCAAGACGGTG 
GCCCTGAACCTGCTCGTCACAGGGAAGATTGTGGACCATGGCAATGGGACCTTCAGCGTCCACTT 
CCAACACAATGCCACAGGCCAGGGAAACATCT CCAT CAGCCTCGTGCCCCCCAGTAAAGCTGTAG 
O AGT TCCACCAGGAACAGCAGATCT T CATCGAAGCCAAGGCCT C CAAAAT CTTCAACTGCCGGATG 

111 GAGTGGGAGAAGGTAGAACGGGGCCGCCGGACCTCGCTTTGCACCCACGACCCAGCCAAGATCTG 
CTCCCGAGACCACGCTCAGAGCTCAGCCACCTGGAGCTGCTCCCAGCCCTTCAAAGTCGTCTGTG 
H ]f TGTACAT CGCCTTC TACAGCACGGACTATCGGCT GGTCCAGAAGGTG TGCCCAGAT TACAACTAC 

f|| CATAGTGATACCCCCTACTACCCATCTGGGTGACCCGGGGCAGGCCACAGAGGCCAGGCCAGGGC 
g TGGAAGGACAGGCCTGCCCATGCAGGAGACCATCTGGACACCGGGCAGGGAAGGGGTTGGGCCTC 
C3 AGGCAGGGAGGGGGGTGGAGACGAGGAGATGCCAAGTGGGGCCAGGGCCAAGTCTCAAGTGGCAG 
JJ] AGAAAGGGT CCCAAGT GCTGGTCCCAACC TGAAGC T GT GGAGT GAC TAGATCACAGGAGCACT GG 

|| AGGAGGAGTGGGCTCTCTGTGCAGCCTCACAGGGCTTTGCCACGGAGCCACAGAGAGATGCTGGG 
m TCCCCGAGGCCTGTGGGCAGGCCGATCAGTGTGGCCCCAGATCAAGTCATGGGAGGAAGCTAAGC 
HI CCT TGGT TCT TGCCAT CCTGAGGAAAGAT AGCAACAGGGAGGGGGAGAT T TCATCAGTGT GGACA 

GCCTGTCAACTTAGGATGGATGGCTGAGAGGGCTTCCTAGGAGCCAGTCAGCAGGGTGGGGTGGG 
GCCAGAGGAGCTCTCCAGCCCTGCCTAGTGGGCGCCCTGAGCCCCTTGTCGTGTGCTGAGCATGG 
CATGAGGCTGAAGTGGCAACCCTGGGGTCTTTGATGTCTTGACAGATTGACCATCTGTCTCCAGC 
CAGGCCACCCCTTTCCAAAATTCCCTCTTCTGCCAGTACTCCCCCTGTACCACCCATTGCTGATG 
GCACACCCAT CCTT AAGC TAAGACAGGACGAT TGT GGT CCT CCCACAC TAAGGCCACAGCCCATC 
CGCGTGCTGTGTGTCCCTCTTCCACCCCAACCCCTGCTGGCTCCTCTGGGAGCATCCATGTCCCG 
GAGAGGGGTCCCTCAACAGTCAGCCTCACCTGTCAGACCGGGGTTCTCCCGGATCTGGATGGCGC 
CGCCCTCTCAGCAGCGGGCACGGGTGGGGCGGGGCCGGGCCGCAGAGCATGTGCTGGATCTGTTC 
TGTGTGTCTGTCTGTGGGTGGGGGGAGGGGAGGGAAGTCTTGTGAAACCGCTGATTGCTGACTTT 
TGTGTGAAGAATCGTGTTCTTGGAGCAGGAAATAAAGCTTGCCCCGGGGCA 
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FIGURE Q2 

MQLTRCCFVFLVQGSLYLVICGQDDGPPGSEDPERDDHEGQPRPRVPRKRGHISPKSRPMANSTL 
LGLLAPPGEAWGILGQPPNRPNHSPPPSAKVKKIFGWGDFYSNIKTVALNLLVTGKIVDHGNGTF 
SVHFQHNATGQGNISISLVPPSKAVEFHQEQQIFIEAKASKIFNCRMEWEKVERGRRTSLCTHDP 
AK I CSRDHAQS S ATWS CS QPFKWCVY I AFYS TDYRLVQKVCPDYNYHS DTP YYPS G 

Important features of the protein: 
Signal peptide: 

amino acids 1-14 

N-glycosylation sites. 

amino acids 62-65, 127-130, 137-140, 143-146 

2-oxo acid dehydrogenases acyl transferase 

amino acids 61-71 
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FIGURE Q3 



pais 



IP 



5*! 
W ii 



1,^ 



CGGTGGCCATGACTGCGGCCGTGTTCTTCGGCTGCGCCTTCATTGCCTTCGGGCCTGCGCTCGCC 
CTTTATGTCTTCACCATCGCCATCGAGCCGTTGCGTATCATCTTCCTCATCGCCGGAGCTTTCTT 
CTGGTTGGTGTCTCTACTGATTTCGTCCCTTGTTTGGTTCATGGCAAGAGTCATTATTGACAACA 
AAGATGGACCAACACAGAAATATCTGCTGATCTTTGGAGCGTTTGTCTCTGTCTATATCCAAGAA 
ATGTTCCGATTTGCATATTATAAACTCTTAAAAAAAGCCAGTGAAGGTTTGAAGAGTATAAACCC 
AGGTGAGACAGCACCCTCTATGCGACTGCTGGCCTATGTTTCTGGCTTGGGCTTTGGAATCATGA 
GTGGAGTATTTTCCTTTGTGAATACCCTATCTGACTCCTTGGGGCCAGGCACAGTGGGCATTCAT 
GGAGATTCTCCTCAATTCTTCCTTTATTCAGCTTTCATGACGCTGGTCATTATCTTGCTGCATGT 
ATTCTGGGGCATTGTATTTTTTGATGGCTGTGAGAAGAAAAAGTGGGGCATCCTCCTTATCGTTC 
TCCTGACCCACCTGCTGGTGTCAGCCCAGACCTTCATAAGTTCTTATTATGGAATAAACCTGGCG 
TCAGCATTTATAATCCTGGTGCTCATGGGCACCTGGGCATTCTTAGCTGCGGGAGGCAGCTGCCG 
AAGCCTGAAACTCTGCCTGCTCTGCCAAGACAAGAACTTTCTTCTTTACAACCAGCGCTCCAGAT 
AACCT CAGGGAACCAGCACT T CCCAAACCGCAGAC T ACATCT T TAGAGGAAGCACAACT G TGCCT 
T T T T C T GAAAAT CCCTTTTTC T GGT GGAAT T G AG AAAG AAAT AAAAC TAT G C AG AT A 
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FIGURE QJ. 



MTAAVFFGCAFIAFGPALALYVFT IAIEPLRI IFLIAGAFFWLVSLLISSLVWFMARVI IDNKDG 
PTQKYLLIFGAFVSVYIQSMFRFAYYKLLKKASEGLKSINPGETAPSMRLLAYVSGLGFGIMSGV 
FSFVNTLSDSLGPGTVGIHGDSPQFFLYSAFMTLVIILLHVFWGIVFFDGCEKKKWGILLIVLLT 
HLLVSAQTFISSYYGINLASAFIILVLMGTWAFLAAGGSCRSLKLCLLCQDKNFLLYNQRSR 

Important features of the protein: 
Signal peptide: 

amino acids 1-19 

Transmembrane domains: 

amino acids 32-51, 119-138, 152-169, 216-235 

Glycosaminoglycan attachment site. 

amino acids 120-123 

Sodium: neurotransmitter symporter family protein 

amino acids 31-65 
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FIGURE 

AAT T T T TCACCAGAGTAAACT TGAGAAACCAACT GGAC CT T GAGTATTGTACATT T TGCCTCGTG 
GACCCAAAGGTAGCAATCTGAAACATGAGGAGTACGATTCTACTGTTTTGTCTTCTAGGATCAAC 
TCGGTCATTACCACAGCTCAAACCTGCTTTGGGACTCCCTCCCACAAAACTGGCTCCGGATGAGG 
GAACACTACCAAACCAACAGCAGT CAAATCAGGT CT TTCCTT CTTTAAG TCTGATACCAT TAACA 
CAGATGC TCACACT GGGGCCAGAT C TGCATC T GT TAAATCC T GC TGCAGGAATGACACCT GGTAC 
CCAGACCCACCCAT TGACCCT GGGAGGGT TGAAT GT ACAACAGCAACTGCACCCACATGT GT TAC 
CAAT T T T TG T CACACAAC T T GGAG CCCAG GGC AC TAT C C T AAGC T C AGAGGAAT T GC CAC AAAT C 
TTCACGAGCCTCATCATCCATTCCTTGTTCCCGGGAGGCATCCTGCCCACCAGTCAGGCAGGGGC 
TAATCCAGATGTCCAGGATGGAAGCCTTCCAGCAGGAGGAGCAGGTGTAAATCCTGCCACCCAGG 
GAACCCCAGCAGGCCGCCTCCCAACTCCCAGTGGCACAGATGACGACTTTGCAGTGACCACCCCT 
GCAGGCATCCAAAGGAGCACACATGCCATCGAGGAAGCCACCACAGAATCAGCAAAT GGAAT TCA 
GTAAGCTGTTTCAAATTTTTTCAACTAAGCTGCCTCGAATTTGGTGATACATGTGAATCTTTATC 
AT TGAT TATAT TAT GGAAT AGAT TGAGACACAT T GGATAGTCT TAGAAGAAAT TAAT TCT T AAT T 
TACCTGAAAATATTCTTGAAATTTCAGAAAATATGTTCTATGTAGAGAATCCCAACTTTTAAAAA 
CAATAAT TCAATGGATAAAT CTGTC TT TGAAATAT AACATTATGCTGCC TGGATGATAT GCATAT 
TAAAACATATTTGGAAAACTGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE Q6 

MRSTILLFCLLGSTRSLPQLKPALGLPPTKLAPDQGTLPNQQQSNQVFPSLSLIPLTQM 
LTLGPDLHLLNPAAGMT PGTQTHPLTLGGLNVQQQLHPHVLP I FVTQLGAQGT I LS SEE 
LPQIFTSLIIHSLFPGGILPTSQAGANPDVQDGSLPAGGAGVNPATQGTPAGRLPTPSG 
TDDDFAVTT PAG I QRS THAI EEAT TESANGI Q 

Signal peptide: 

amino acids 1-16 
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FIGURE Q7 



GCTCAAGTGCCCTGCCTTGCCCCACCCAGCCCAGCCTGGCCAGAGCCCCCTGGAGAAGGAGCTCT 
CTTCTTGCTTGGCAGCTGGACCAAGGGAGCCAGTCTTGGGCGCTGGAGGGCCTGTCCTGACCATG 
GTCCCTGCCTGGCTGTGGCTGCTTTGTGTCTCCGTCCCCCAGGCTCTCCCCAAGGCCCAGCCTGC 
AGAGCTGTCTGTGGAAGTTCCAGAAAACTATGGTGGAAATTTCCCTTTATACCTGACCAAGTTGC 
CGCTGCCCCGTGAGGGGGCTGAAGGCCAGATCGTGCTGTCAGGGGACTCAGGCAAGGCAACTGAG 
GGCCCATTTGCTATGGATCCAGATTCTGGCTTCCTGCTGGTGACCAGGGCCCTGGACCGAGAGGA 
GCAGGCAGAGTACCAGCTACAGGTCACCCTGGAGATGCAGGATGGACATGTCTTGTGGGGTCCAC 
AGCCTGTGC T TGT GCAC GT GAAGGATGAGAAT GACCAGGTGCCCCAT T T C TC T CAAGCCAT CTAC 
AGAGCTCGGCTGAGCCGGGGTACCAGGCCTGGCATCCCCTTCCTCTTCCTTGAGGCTTCAGACCG 
GGATGAGCCAGGCACAGCCAACTCGGAT C T TCGAT TCCACAT CCT GAGCCAGGCT CCAGCCCAGC 
H CTTCCCCAGACATGTTCCAGCTGGAGCCTCGGCTGGGGGCTCTGGCCCTCAGCCCCAAGGGGAGC 
'P ACCAGCCTTGACCACGCCCTGGAGAGGACCTACCAGCTGTTGGTACAGGTCAAGGACATGGGTGA 
f*i CCAGGCCTCAGGCCACCAGGCCACTGCCACCGTGGAAGTCTCCATCATAGAGAGCACCTGGGTGT 
CCCTAGAGCC TAT CCACC T GGCAGAGAATC TCAAAG TCCTAT ACCCGCACCACATGGCCCAGGTA 
f ! CACTGGAGTGGGGGTGATGTGCACTATCACCTGGAGAGCCATCCCCCGGGACCCTTTGAAGTGAA 
W T GCAGAGGGAAACCTCT ACGT GACCAGAGAGC TGGACAGAGAAGCCCAGGCT GAGT ACC T GCT CC 

SI AGGTGCGGGCTCAGAATTCCCATGGCGAGGACTATGCGGCCCCTCTGGAGCTGCACGTGCTGGTG 
y ; ATGGATGAGAATGACAACGTGCCTATCTGCCCTCCCCGTGACCCCACAGTCAGCATCCCTGAGCT 
0t% » CAGTCCACCAGGTACTGAAGTGACTAGACTGTCAGCAGAGGATGCAGATGCCCCCGGCTCCCCCA 

U * ATTCCCACGTTGTGTATCAGCTCCTGAGCCCTGAGCCTGAGGATGGGGTAGAGGGGAGAGCCTTC 
• CAGGTGGACCCCACTTCAGGCAGTGTGACGCTGGGGGTGCTCCCACTCCGAGCAGGCCAGAACAT 
C| CCTGCTTCTGGTGCTGGCCATGGACCTGGCAGGCGCAGAGGGTGGCTTCAGCAGCACGTGTGAAG 
If! TCGAAGTCGCAGTCACAGATATCAATGATCACGCCCCT GAGT TCATCACTTCCCAGATT GGGCCT 

J*% ATAAGCCTCCCTGAGGATGTGGAGCCCGGGACTCTGGTGGCCATGCTAACAGCCATTGATGCTGA 
CCTCGAGCCCGCCTTCCGCCTCATGGATTTTGCCATTGAGAGGGGAGACACAGAAGGGACTTTTG 
Wl GCCTGGATTGGGAGCCAGACTCTGGGCATGTTAGACTCAGACTCTGCAAGAACCTCAGTTATGAG 
O GCAGCTCCAAGTCATGAGGTGGTGGTGGTGGTGCAGAGTGTGGCGAAGCTGGTGGGGCCAGGCCC 
HI . AGGCCCTGGAGCCACCGCCACGGTGACTGTGCTAGTGGAGAGAGTGATGCCACCCCCCAAGTTGG 

ACCAGGAGAGCTACGAGGCCAGTGTCCCCATCAGTGCCCCAGCCGGCTCTTTCCTGCTGACCATC 
CAGCCCTCCGACCCCATCAGCCGAACCCTCAGGTTCTCCCTAGTCAATGACTCAGAGGGCTGGCT 
CTGCATTGAGAAATTCTCCGGGGAGGTGCACACCGCCCAGTCCCTGCAGGGCGCCCAGCCTGGGG 
ACACCTACACGGTGCTTGTGGAGGCCCAGGATACAGCCCTGACTCTTGCCCCTGTGCCCTCCCAA 
TACCTCTGCACACCCCGCCAAGACCATGGCTTGATCGTGAGTGGACCCAGCAAGGACCCCGATCT 
GGCCAGTGGGCACGGTCCCTACAGCTTCACCCTTGGTCCCAACCCCACGGTGCAACGGGATTGGC 
GCCTCCAGACTCTCAATGGTTCCCATGCCTACCTCACCTTGGCCCTGCATTGGGTGGAGCCACGT 
GAACACATAATCCCCGTGGTGGTCAGCCACAATGCCCAGATGTGGCAGCTCCTGGTTCGAGTGAT 
CGTGTGTCGCTGCAACGTGGAGGGGCAGTGCATGCGCAAGGTGGGCCGCATGAAGGGCATGCCCA 
CGAAGCTGTCGGCAGTGGGCATCCTTGTAGGCACCCTGGTAGCAATAGGAATCTTCCTCATCCTC 
ATTTTCACCCACTGGACCATGTCAAGGAAGAAGGACCCGGATCAACCAGCAGACAGCGTGCCCCT 
GAAGGCGACTGTCTGAATGGCCCAGGCAGCTCTAGCTGGGAGCTTGGCCTCTGGCTCCATCTGAG 
TCCCCTGGGAGAGAGCCCAGCACCCAAGATCCAGCAGGGGACAGGACAGAGTAGAAGCCCCTCCA 
TCTGCCCTGGGGTGGAGGCACCATCACCATCACCAGGCATGTCTGCAGAGCCTGGACACCAACTT 
TATGGAC TGCC CAT GGGAGTGC T CCAAATGT CAGGGT GT TTGCCCAATAATAAAGCC CCAGAGAA 
CTGGGGTGGGCCCTATGGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAG 
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FIGURE Q8 



MVPAWLWLLCVSVPQALPKAQPAELSVEVPENYGGNFPLYLTKLPLPREGAEGQIVLSGDSGKAT 
EGPFAMDPDSGFLLVTRALDREEQAEYQLQVTLEMQDGHVLWGPQPVLVHVKDENDQVPHFSQAI 
YRARLSRGTRPGIPFLFLEASDRDEPGTANSDLRFHILSQAPAQPSPDMFQLEPRLGALALSPKG 
STSLDHALERTYQLLVQVKDMGDQASGHQATATVEVS I IESTWVSLEPIHjLAENLKVLYPHHMAQ 
VHWSGGDYHYHLESHPPGPFEVNAEGNLYVTRELDREAQAEYLLQVRAQNSHGEDYAAPLELHVL 
VMDENDNVPICPPRDPTVSIPELSPPGTEVTRLSAEDADAPGSPNSHWYQLLSPEPEDGVEGRA 
FQVDPTSGSVTLGVLPLRAGQNILLLVIAMDLAGAEGGFSSTCEVEVAVTDINDHAPEFITSQIG 

I* 

;. PISLPEDVEPGTLVAMLTAIDADLEPAFRLMDFAIERGDTEGTFGLDWEPDSGHVRLRLCKNLSY 

I 

| EAAPSHEVVVWQSVAKLVGPGPGPGATATVTVLVERYMPPPKLDQES YEASVPI SAPAGS FLLT 

1 IQPSDPISRTLRFSLVNDSEGWLCIEKFSGEVHTAQSLQGAQPGDTYTVLVEAQDTALTLAPVPS 
I QYLCTPRQDHGLIVSGPSKDPDLASGHGPYSFTLGPNPTVQRDWRLQTLNGSHAYLTLALHWVEP 
f REHI I PVWSHNAQMWQLLVRVIVCRCIWEGQCMRKYGRMKGMPTKLSAVGILVGTLVAIGI FLI 

is 

_ I LI FTHWTMSRKKDPDQPADS YPLKAT V 

5 Signal peptide: 

fa - 

M amino acids 1-18 

as. 

Transmembrane domain: 

BiP" 

y ■ amino acids 762-784 
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FIGURE QQ 



GGCTGACCGTGCTACATTGCCTGGAGGAAGCCTAAGGAACCCAGGCATCCAGCTGCCCACGCCTG 
AGTCCAAGAT T C T TCCCAGGAACACAAACGTAGGAGACCCAC GC TCC TGGAAGCACCAGCCT T TA 
TCTCTTCAGCTTCAAGTCCCCTTTCTCAAGAATCCTCTGTTCTTTGCCCTCTAAAGTCTTGGTAC 
ATCTAGGACCCAGGCAT CT T GC T T TCCAGCCACAAAGAGACAGATGAAGATGCAGAAAGGAAATG 
T TCTCCTTATGT TTGGTCTACTAT TGCATT TAGAAGCTGCAACAAATTCCAATGAGACTAGCACC 
TCTGCCAACACTGGATCCAGTGTGATCTCCAGTGGAGCCAGCACAGCCACCAACTCTGGGTCCAG 
TGTGACCTCCAGTGGGGTCAGCACAGCCACCATCTCAGGGTCCAGCGTGACCTCCAATGGGGTCA 
GCATAGTCACCAACT C T GAGT TCCATACAACCT CCAGTGGGATCAGCACAGCCACCAACT CT GAG 
T T CAGCACAGCGTCCAGTGGGATCAGCATAGCCACCAACTC TGAGT CCAGCACAACCT CCAGTGG 
GGCCAGCACAGCCACCAAC T CTGAGTCCAGCACACCCT CCAGTGGGGC CAGCACAGT CACCAACT 
M CTGGGTCCAGTGTGACCTCCAGTGGAGCCAGCACTGCCACCAACTCTGAGTCCAGCACAGTGTCC 
O AGTAGGGCCAGCACTGCCACCAACTCTGAGTCTAGCACACTCTCCAGTGGGGCCAGCACAGCCAC 
fi% CAACTCTGAC TCCAGCACAACCT CCAGTGGGGCTAGCACAGCCACCAAC TCT GAGT CCAGCACAA 

S CC TCCAGTGGGGCCAGCACAGCCACCAACTC TGAGT CCAGCACAGTGTCCAGTAGGGCCAGCACT 

f l GCCACCAACTCTGAGTCCAGCACAACCTCCAGTGGGGCCAGCACAGCCACCAACTCTGAGTCCAG 
W AACGACCTCCAATGGGGCTGGCACAGCCACCAACTCTGAGTCCAGCACGACCTCCAGTGGGGCCA 
H GCACAGCCACCAAC TCT GACT CCAGCACAGTGTCCAGT GGGGCCAGCACT GCCACCAACT CTGAG 

§»*!,. TCCAGCACGACCTCCAGTGGGGCCAGCACAGCCACCAACTCTGAGTCCAGCACGACCTCCAGTGG 
f|l GGCTAGCACAGCCACCAACTCTGACTCCAGCACAACCTCCAGTGGGGCCGGCACAGCCACCAACT 
V 5 CTGAGTCCAGCACAGTGTCCAGTGGGATCAGCACAGTCACCAATTCTGAGTCCAGCACACCCTCC 
^ AGTGGGGCCAACACAGCCACCAACT CT GAGTCCAGTACGACCTCCAGT GGGGCCAACACAGCCAC 

C| CAACTCTGAGTCCAGCACAGTGTCCAGTGGGGCCAGCACTGCCACCAACTCTGAGTCCAGCACAA 
|jff CCTCCAGTGGGGT CAGCACAGCCACCAAC TCT GAGT CCAGCACAACC TCCAGTGGGGC TAGCACA 

GCCACCAACTC T GACT CCAGCACAACC TCCAGTGAGGCCAGCACAGCCACCAACTCT GAGT CTAG 
Jjf CACAGTGTCCAGT GGGAT CAGCACAGTCACCAAT TC TGAGT CCAGCACAACC TCCAGTGGGGCCA 

Wl ACACAGC CAC CAAC TCTGGGTC C AG TG T GACC T C T G CAG GC TCT GGAACAGCAGCT C T GAC TGGA 

© ATGCACACAACTTCCCATAGTGCATCTACTGCAGTGAGTGAGGCAAAGCCTGGTGGGTCCCTGGT 
PI GCCGTGGGAAATCTTCCTCATCACCCTGGTCTCGGTTGTGGCGGCCGTGGGGCTCTTTGCTGGGC 
TCTTCTTCTGTGT GAGAAACAGCC T GTCCCTGAGAAACACC T T TAACACAGC TGTCTACCACCCT 
CATGGCCTCAACCATGGCCTTGGTCCAGGCCCTGGAGGGAATCATGGAGCCCCCCACAGGCCCAG 
GTGGAGTCCTAACT GGT TCTGGAGGAGACCAGTATCAT CGATAGCCATGGAGATGAGCGGGAGGA 
ACAGC GG GC C C TGAGCAGC C C C G GAAGCAAG T GC C G CAT T C T T C AGGAAGGAAGAGAC C T G GGCA 
CCCAAGACCTGGTTTCCTTTCATTCATCCCAGGAGACCCCTCCCAGCTTTGTTTGAGATCCTGAA 
AATCTTGAAGAAGGTAT T CC TCACCT T TC T TGCCT T T ACCAGACAC T GGAAAGAGAATACT ATAT 
TGC TCAT T TAGCTAAGAAAT AAATACATC TCAT CTAACACACACGACAAAGAGAAGC TGT GCT T G 
CCCCGGGGTGGGTATCTAGCTCTGAGATGAACTCAGTTATAGGAGAAAACCTCCATGCTGGACTC 
CAT CTGGCAT TCAAAAT C TCCACAGTAAAATCCAAAGAC CT CAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 100 



MKMQKGNVLLMFGLLLHLEAATNSNETSTSANTGSSVISSGASTATNSGSSVTSSGVSTATISGS 
SVTSNGVSIVTNSEFHTTSSGISTATNSEFSTASSGISIATNSESSTTSSGASTATNSESSTPSS 
GASTVTNSGSSVTSSGASTATNSESSTVSSRASTATNSESSTLSSGASTATNSDSSTTSSGASTA 
TNS ES S T T S S GAS TATNSE S S TVS SRAS TATNSE S S T T S SGAS TATNS E SRT T SNGAGTATNS ES 
STTSSGASTATNSDSSTVSSGASTATNSESSTTSSGASTATNSESSTTSSGASTATNSDSSTTSS 
GAGTATNSESSTVSSGISTVTNSESSTPSSGANTATNSESSTTSSGANTATNSESSTVSSGASTA 
TNSESSTTSSGVSTATNSESSTTSSGASTATNSDSSTTSSEASTATNSESSTVSSGISTVTNSES 
STTSSGANTATNSGSSVTSAGSGTAALTGMHTTSHSASTAVSEAKPGGSLVPWEIFLITLVSWA 
AVGLFAGL FFC VRNS LS LRNT FNTAVYHPHGLNHGLGPGPGGNHGAPHRPRWS PNW FWRRPVS S I 
AMEMSGRNSGP 

Signal peptide: 

amino acids 1-20 

Transmembrane domain: 

amino acids 510-532 
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FIGURE 1Q1 



GGCCGGACGCCTCCGCGTTACGGGATGAATTAACGGCGGGTTCCGCACGGAGGTTGTGACCCCTA 
CGGAGCCCCAGCTTGCCCACGCACCCCACTCGGCGTCGCGCGGCGTGCCCTGCTTGTCACAGGTG 
GGAGGC TGGAAC TAT CAGGC T GAAAAACAGAGTGGGT ACTCTC TT C TGGGAAGCT GGCAACAAAT 
GGATGATGTGATATATGCATTCCAGGGGAAGGGAAATTGTGGTGCTTCTGAACCCATGGTCAATT 
AAC GAGGC AG T T T C T AGC T AC TGCACG T AC T T CAT AAAGCAGGAC T C T AAAAGC T T T GGAAT CAT 
GGTGTCATGGAAAGGGATTTACTTTATACTGACTCTGTTTTGGGGAAGCTTTTTTGGAAGCATTT 
TCATGCTGAGTCCCTTTTTACCTTTGATGTTTGTAAACCCATCTTGGTATCGCTGGATCAACAAC 
CGCCTTGTGGCAACATGGCTCACCCTACCTGTGGCATTATTGGAGACCATGTTTGGTGTAAAAGT 
GAT T ATAAC T GGGGAT GCAT T TGT T CCTGGAGAAAGAAGT GT CAT TATCATGAACCATCGGACAA 
IP GAATGGACT GGAT GT T CCTGT GGAATT GCC TGAT GCGATATAGCTACC TCAGAT TGGAGAAAAT T 

5*1 TGCCTCAAAGCGAGTCTCAAAGGTGTTCCTGGATTTGGTTGGGCCATGCAGGCTGCTGCCTATAT 
y CTTCAT T CATAGGAAATGGAAGGATGACAAGAGCCAT TT CGAAGACAT GAT TGATT ACT T T T GTG 

m ATATT CACGAACCACT T CAACTCC T CATATTCCCAGAAGGGAC TGATCTCACAGAAAACAGCAAG 

i TCTCGAAGTAAT GCAT T T GCT GAAAAAAATGGAC T TCAGAAATAT GAATATGT T T TACATCCAAG 

CI AACTACAGGC T T TACT T T TGT GGTAGACCGTCTAAGAGAAGGTAAGAACCT TGATGC TGTCCATG 

ATATCACTGT GGCGTAT C CTCACAACAT T CC TCAAT CAGAGAAGCACC T CCTCCAAGGAGACT T T 
CCCAGGGAAATCCACT TT CACGTCCACC GGTAT CCAATAGACACCC TCCCCACATCCAAGGAGGA 



HI 



m CCTTCAACTCTGGTGCCACAAACGGTGGGAAGAGAAAGAAGAGAGGCTGCGTTCCTTCTATCAAG 



ry 



GGGAGAAGAAT T TT TAT T TTACCGGACAGAGT GT CATT CCACC T TGCAAGTC TGAACTCAGGGTC 
CTTGTGGTCAAATTGCTCTCTATACTGTATTGGACCCTGTTCAGCCCTGCAATGTGCCTACTCAT 
ATATTTGTACAGT CT T GT TAAGT GGTAT T TTATAAT CACCAT TGTAATCT TTGT GC TGCAAGAGA 
GAATAT T TGGT GGAC T GGAGATCATAGAACT T GCAT GT TACCGAC T T T TACACAAACAGCCACAT 
T TAAAT TCAAAGAAAAATGAG TAA GAT TATAAGGT T TGCCAT GTGAAAACCTAGAGCATAT TT TG 
GAAATG T TC TAAACCTT TCTAAGC T CAGATGCAT T T T T GCAT GAC TATGTCGAATAT T T CT TACT 
GCCATCATTAT TT GT TAAAGATAT T T T GCAC T TAAT T T T GTGGGAAAAATAT TGCTACAAT TT TT 
TTTAATCTCTGAATGTAATTTCGATACTGTGTACATAGCAGGGAGTGATCGGGGTGAAATAACTT 
GGGC C AGAATAT TAT TAAAC AAT CAT CAGGC T T T T AAA 
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FIGURE 102 



MHSRGREIWLLNPWSINEAVSSYCTYFIKQDSKSFGIMVSWKGIYFILTLFWGSFFGSIFMLSP 
FLPLMFVNPSWYRWII^RLVATWLTLPVALLETMFGVKVI ITGDAFVPGERSVI IMNHRTRMDWM 
FLWNCLMRYS YLRLEKI CLKASLKGVPGFGWAMQAAAYI FIHRKWKDDKSHFEDMIDYFCDIHEP 
LQLLIFPEGTDLTENSKSRSNAFAEKNGLQKYEYVLHPRTTGFTFWDRLREGKNLDAVHDITVA 
YPHNIPQSEKHLLQGDFPREIHFHVHRYPIDTLPTSKEDLQLWCHKRWEEKEERLRSFYQGEKNF 
YFTGQSVIPPCKSELRVLVVKLLSILYWTLFSPAMCLLIYLYSLVKWYFIITIVIFVLQERIFGG 
LE I IELAC YRLLHKQPHLNSKKNE 

Important features of" the protein: 
Signal peptide: 

amino acids 1-22 

Transmembrane domains: 

amino acids 44-63, 90-108, 354-377 . 
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FIGURE 103 



CGGCTCGAGCGGCTCGAGTGAAGAGCCTCTCCACGGCTCCTGCGCCTGAGACAGCTGGCCTGACC 
TCCAAATCATCCATCCACCCCTGCTGTCATCTGTTTTCATAGTGTGAGATCAACCCACAGGAATA 
TCC ATGG CTTTTGTGCTCATTTTGGTTCTCAGTTTCTACGAGCTGGTGTCAGGACAGTGGC7UVGT 
CACTGGACCGGGCAAGTTTGTCCAGGCCTTGGTGGGGGAGGACGCCGTGTTCTCCTGCTCCCTCT 
TTCCTGAGACCAGTGCAGAGGCTATGGAAGTGCGGTTCTTCAGGAATCAGTTCCATGCTGTGGTC 
CACCT C TACAGAGATGGGGAAGAC TGGGAATCT AAGCAGAT GCCACAGTATCGAGGGAGAACTGA 
GTTTGTGAAGGACTCCATTGCAGGGGGGCGTGTCTCTCTAAGGCTAAAAAACATCACTCCCTCGG 
ACATCGGCCTGTATGGGTGCTGGTTCAGTTCCCAGATTTACGATGAGGAGGCCACCTGGGAGCTG 
CGGGTGGCAGCACTGGGCTCACTTCCTCTCATTTCCATCGTGGGATATGTTGACGGAGGTATCCA 
GTTACTCTGCCTGTCCTCAGGCTGGTTCCCCCAGCCCACAGCCAAGTGGAAAGGTCCACAAGGAC 
M s AGGAT T TGTC T T CAGACT CCAGAGCAAATGCAGAT GGGTACAGCC TGTAT GAT GT GGAGATCT CC 

Q AT TATAGTCCAGGAAAATGCT GGGAGCATATT GT GT TC CAT CCACC TTGC TGAGCAGAGT CAT GA 

PI GGTGGAATCCAAGGTATTGATAGGAGAGACGTTTTTCCAGCCCTCACCTTGGCGCCTGGCTTCTA 
III TTTTACTCGGGTTACTCTGTGGTGCCCTGTGTGGTGTTGTCATGGGGATGATAATTGTTTTCTTC 
AAATCCAAAGGGAAAATCCAGGCGGAACTGGACTGGAGAAGAAAGCACGGACAGGCAGAATTGAG 
Ul AGACGCCCGGAAACACGCAGTGGAGGTGACTCTGGATCCAGAGACGGCTCACCCGAAGCTCTGCG 
TTTCTGATCTGAAAACTGTAACCCATAGAAAAGCTCCCCAGGAGGTGCCTCACTCTGAGAAGAGA 
TTTACAAGGAAGAGTGTGGTGGCTTCTCAGGGTTTCCAAGCAGGGAGACATTACTGGGAGGTGGA 
CGTGGGACAAAATGTAGGGTGGTATGTGGGAGTGTGTCGGGATGACGTAGACAGGGGGAAGAACA 
ATGTGAC T T T GTCT CCCAACAATGGGTATT GGGTCCT CAGACTGACAACAGAACAT T TGTAT T TC 
». ACATTCAATCCCCATTTTATCAGCCTCCCCCCCAGCACCCCTCCTACACGAGTAGGGGTCTTCCT 
Q GGACTATGAGGGTGGGACCATCTCCTTCTTCAATACAAATGACCAGTCCCTTATTTATACCCTGC 
fp, T GACATG TCAGT T T GAAGGC T TGT T GAGACCC T ATATCCAGCATGCGATGTATGACGAGGAAAAG 

%1 GGGAC T CCCATAT T CATAT GTCCAGTGTCC TGGGGATGAGACAGAGAAGACCCTGCT TAAAGGGC 

CCCACACCACAGACCCAGACACAGCCAAGGGAGAGT GC TCCCGACAGGTGGCCCCAGCT TCCT CT 
III CCGGAGCCTGCGCACAGAGAGTCACGCCCCCCACTCTCCTTTAGGGAGCTGAGGTTCTTCTGCCC 
Gl TGAGCCCTGCAGCAGCGGCAGTCACAGCTTCCAGATGAGGGGGGATTGGCCTGACCCTGTGGGAG 
f|| TCAGAAGCCATGGCTGCCCTGAAGTGGGGACGGAATAGACT CACATTAGGTT TAGT TTGTGAAAA 

CTCCATCCAGCTAAGCGATCTTGAACAAGTCACAACCTCCCAGGCTCCTCATTTGCTAGTCACGG 
ACAGTGATTCCTGCCTCACAGGTGAAGATTAAAGAGACAACGAAT GTGAATCAT GCT TGCAGGT T 
TGAGGGCACAGTGTTTGCTAATGATGTGTTTTTATATTATACATTTTCCCACCATAAACTCTGTT 
T GCTTAT TCCACAT TAAT TTAC T T T TCTCTAT ACCAAAT CACCCAT GGAATAGT TAT T GAACACC 
TGC TTTGTGAGGCTCAAAGAATAAAGAGGAGGTAGGATT TT TCACTGATTCTATAAGCCCAGCAT 
TACCTGATACCAAAACCAGGCAAAGAAAACAGAAGAAGAGGAAGGAAAACTACAGGTCCATATCC 
C TCAT TAACACAGACACAAAAATTC TAAATAAAAT T T TAACAAAT TAAACTAAACAATATATT TA 
AAGATGATATATAACTAC TCAGTG TGGT T TGT CCCACAAAT GCAGAGT TGGTT TAATAT T TAAAT 
ATCAACCAGTGTAATT CAGCACAT TAATAAAGTAAAAAAGAAAACCATAAAAAAAAAAAAAAA 
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MAFVLILVLSFYELVSGQWQVTGPGKFVQA^^ 

LYRDGEDWESKQMPQYRGRTEFVKDSIAGGRVSLRLKNITPSDIGLYGCWFSSQIYDEEATWELR 
VAALGS LPL I S I VGYVDGG I QLLCLS S GWFPQPTAKWKGPQGQDL S S DS RANADG YS L YDVE I S I 
IVQENAGSILCSIHLAEQSHEVESKVLIGETFFQPSPWRLASILLGLLCGALCGVVMGMIIVFFK 
SKGKIQAELDWRRKHGQAELRDARKHAVEVTLDPETAHPKLCVSDLKTVTHRKAPQEVPHSEKRF 
TRKSWASQGFQAGRHYWEVDVGQNVGWYVGVCRDDVDRGKNNVTLSPNNGYWVLRLTTEHLYFT 
FNPHFI SLPPSTPPTRVGVFLDYEGGT I S FFNTNDQSLI YTLLTCQFEGLLRPY I QHAMYDEEKG 
TPIFICPVSWG 

jf|| Signal peptide: 

|sj amino acids 1-17 

Transmembrane domains: 

as | 

amino acids 131-150, 235-259 
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CCTTCACAGGACTCTTCATTGCTGGTTGGCAATGATGTATCGGCCAGATGTGGTGAGGGCTAGGMAAGAG 
TTTGTTGGGAACCCTGGGTTATCGGCCTCGTCATCTTCATATCCCTGATTGTCCTGGCAGTGTGCATTGGA 
CTCACT GT TCAT T ATGT GAGAT AT AAT C AAAAGAAG AC CT AC AATT ACT AT AGC ACAT TGTC AT TT ACAAC 
TG AC AAACTATATGCT GAGT TT GGCAGAGAGGCTTCT AAC AATT TT AC AGAAAT GAGCCAGAGACT TG AAT 
CAATGGTGAAAAATGCATTTTATAAATCTCCATTAAGGGAAGAATTTGTCAAGTCTCAGGTTATCAAGTTC 
AGTC AACAGAAGCATGGAGT GT TGGCTC AT AT GCTGTT GATT TGT AGATTTC ACT CTACT GAGGAT CCT GA 
AACT GT AGAT AAAATT GTTC AACTTGTT TT AC AT GAAAAGCT GC AAGAT GCT GT AGGACCCCCT AAAGT AG 
ATCCTCACTCAGTTAAAATTAAAAAAATCAACAAGACAGAAACAGACAGCTATCTAAACCATTGCTGCGGA 
H ACAC GAAGAAGT AAAACTCT AGGTC AGAGTCT C AGGAT CGTT GGT G GGAC AGAAGT AGAAGAGGGTGAAT G 

O GCCCTGGCAGGCTAGCCTGCAGTGGGATGGGAGTCATCGCTGTGGAGCAACCTTAATTAATGCCACATGGC 
IP TTGT GAGT GCTGCT C ACTGT TT T ACAAC AT AT AAGAAC CCTGCC AG AT GGAC TGCTTCCT TTGG AGTAAC A 

U.i AT AAAACCTT CGAAAATGAAACGGGGTC TCCGGAGAAT AATT GTCC ATGAAAAATACAAACACCCATCAC A 

u TGAC TATGATATTT CT CT TGCAGAGCTT TCT AGCCCT GTT CCCT AC AC AAATGC AGT AC ATAGAGT TTGT C 

TCCCTGATGCATCCTATGAGTTTCAACCAGGTGATGTGATGTTTGTGACAGGATTTGGAGCACTGAAAAAT 
GATGGTTACAGTCAAAATCATCTTCGACAAGCACAGGTGACTCTCATAGACGCTACAACTTGCAATGAACC 
*»$ TC AAGCT T AC AATGACGC CAT AAC TCCT AGAATGTT AT GT GCTGGCTCCT T AGAAGGAAAAACAGATGCAT 

? || GCCAGGGTGACTCT GG AGGACC ACTGGT T AGT TCAGAT GCT AGAGAT AT C TGGT ACCT TGCT GGAATAGT G 

f I AGCT GGGGAGATGAAT GT GCGAAACCCAAC AAGCCTGGTGT TT ATACT AGAGTT ACGGCCTT GC GGGACT G 

jjpQ GATTACTTCAAAAACTGGTATCTAAGAGACAAAAGCCTCATGGAACAGATAACATTTTTTTTTGTTTTTTG 
%4 GGTGTGGAGGCCATTT T T AGAGAT AC AGAATT GGAGAAGACTTGCAAAAC AGCT AGAT TT GACT GATCTC A 



ru 



ATAAACTGTTTGCTTGATGCATGTATTTTCTTCCCAGCTCTGTTCCGCACGTAAGCATCCTGCTTCTGCCA 
GATCAACTCTGTCATCTGTGAGCAATAGTTGAAACTTTATGTACATAGAGAAATAGATAATACAATATTAC 
AT T ACAGCCT GT AT TC ATT T GTTCTCT AGAAGTT TTGT CAGAAT TT T GACTT GT TGAC AT AAAT TT GT AAT 
GCATATATACAATTTGAAGCACTCCTTTTCTTCAGTTCCTCAGCTCCTCTCATTTCAGCAAATATCCATTT 
TC AAGGT GC AGAACAAGGAGT GAAAG AAAAT ATAAGAAGAAAAAAATCCCCT AC ATTTTATT GGCAC AGAA 
AAGT AT TAGGT GTT TTTCTTAGTGGAAT ATT AGAAAT GAT CAT ATT C ATT ATGAAAGGTC AAGC AAAGACA 
GCAGAATACCAATCACTTCATCATTTAGGAAGTATGGGAACTAAGTTAAGGAAGTCCAGAAAGAAGCCAAG 
AT AT AT CCT TAT T T TCAT TTCCAAACAACTACT ATGAT AAAT GT GAAGAAGATTCT GT TT T T TT GT GACCT 
AT AAT AATT AT ACAAACT TC AT GCAATGTACT TGTTCT AAGC AAAT T AAAGC AAAT AT TT AT TT AACAT TG 
TT ACTGAGGATGT CAACATAT AAC AAT AAAAT AT AAAT CACCCA 
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MMYRPDVVRARKRVCWEPWIGLVIFISLIVIAVCIGLTVHYVRYNQKKTYNYYSTLSFTTDKLY 
AEFGREASNNFTEMSQRLESMVTOtfAFYKSPLREEFVKSQVIKFSQQ 

PETVDKIVQLVLHEKLQDAVGPPKVDPHSVKIKKINKTETDSYLNHCCGTRRSKTLGQSLRIVGG 
TEVEEGEWPWQASLQWDGSHRCGATLINATWLVSAAHCFTTYKNPARWTASFGVTIKPSKMKRGL 
RR 1 1 VHEKYKHP S HDYD I S LAELS S PVP YTNAVHRVCLPDAS YE FQPGDVMFVTGFGALKNDG YS 
QNHLRQAQVTL I DATT CNEPQAYNDAI T PRMLCAGSLE GKT DACQGDS GGPLVS S DARD I WYLAG 
IVSWGDECAKPNKPGVYTRVTALRDWITSKTGI 

Transmembrane domain: 

amino acids 21-40 (type II) 
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AGAGAAAGAAGCGTCTCCAGCTGAAGCCAATGCAGCCCTCCGGCTCTCCGCGAAGAAGTTCCCTG 
CCCCGATGAGCCCCCGCCGTGCGTCCCCGACTATCCCCAGGCGGGCGTGGGGCACCGGGCCCAGC 
GCCGACGATCGCTGCCGTTTTGCCCTTGGGAGTAGGATGTGGTGAAAGGATGGGGCTTCTCCCTT 
ACGGGGCTCACAATGGCCAGAGAAGATTCCGTGAAGTGTCTGCGCTGCCTGCTCTACGCCCTCAA 
TCTGCTCTTTTGGTTAATGTCCATCAGTGTGTTGGCAGTTTCTGCTTGGATGAGGGACTACCTAA 
ATAATGTTCTCACTTTAACTGCAGAAACGAGGGTAGAGGAAGCAGTCATTTTGACTTACTTTCCT 
GTGGTTCATCCGGTCATGATTGCTGTTTGCTGTTTCCTTATCATTGTGGGGATGTTAGGATATTG 
TGGAACGGTGAAAAGAAATCTGTTGCTTCTTGCATGGTACTTTGGAAGTTTGCTTGTCATTTTCT 
GTGTAGAACTGGCTTGTGGCGTTTGGACATATGAACAGGAACTTATGGTTCCAGTACAATGGTCA 
GATATGGTCACT T T GAAAGCCAGGATGACAAATTAT GGAT TACCTAGATATCGGT GGC T TAC TCA 
jU TGC TTGGAAT TT T T T TCAGAGAGAGT T TAAGT GCT GT GGAGTAGTATAT T TCACT GACTGGT T GG 

fl AAATGACAGAGAT GGACTGGCCCCCAGAT TCCTGC TGT GTT AGAGAAT TCCCAGGAT GT T CCAAA 

X CAG GC C CAC C AG GAAGAT C T CAGT GAC C T T TAT CAAGAGGGT T GT GGGAAGAAAAT G TAT T C C T T 

Jjf TTTGAGAGGAACCAAACAACTGCAGGTGCTGAGGTTTCTGGGAATCTCCATTGGGGTGACACAAA 
V 1 TCCTGGCCATGATTCTCACCATTACTCTGCTCTGGGCTCTGTATTATGATAGAAGGGAGCCTGGG 
flf AC AGAC C AAAT GAT G T C C T T G AAG AAT GAC AAC T C T CAG CAC C T G T CAT GT C C C T C AG T AG AAC T 
\f GT TGAAACCAAGCCTGTCAAGAATC TT TGAACACACAT CCAT GGCAAACAGC T T T AATACACACT 

ll T TGAGAT GGAGGAGTTATAAAAAGAAAT GTCACAGAAGAAAACCACAAAC TTGT T T TAT T GGACT 

^ TGTGAATTTTTGAGTACATACTATGTGTTTCAGAAATATGTAGAAATAAAAATGTTGCCATAAAA 
& l TAACACCTAAGCATATACTAT TCTATGC TTTAAAATGAGGATGGAAAAGT TTCATGTCATAAGTC 

* ACCACC TGGACAATAAT T GAT GCCCT TAAAATGC TGAAGACAGATGTCATACCCAC T GT GTAGCC 

p . TGT GTAT GAC T T T TACT GAACACAGT TATGT T TT GAGGCAGCATGGTT TGAT TAGCATTT CCGCA 

in TCCAT GCAAACGAGTCACATATGGTGGGACT GGAGCCATAGTAAAGGT TGAT T TAC T TC TACCAA 

%l CTAGTATATAAAGTAC TAAT TAAATGC TAACATAGGAAGT TAGAAAATAC TAATAAC T T T TAT TA 

U CTCAGCGAT C TAT TCT T CTGATGC TAAATAAAT TATATATCAGAAAAC T T TCAATAT TGGT GACT 

|!1 ACC TAAATGTGAT T T T TGCT GGTTACTAAAATAT TCT TACCAC TTAAAAGAGCAAGC TAACACAT 

Q TGTC T TAAGC TGAT CAGGGATTT T T TGTATATAAGT CT GT GT TAAA.TCT GTATAAT T CAGT CGAT 

ps TTCAGTTCTGATAATGTTAAGAATAACCATTATGAAAAGGAAAATTTGTCCTGTATAGCATCATT 
ATTTTTAGCCTTTCCTGTTAATAAAGCTTTACTATTCTGTCCTGGGCTTATATTACACATATAAC 
TGT TATT TAAATACTTAACCACTAATTT TGAAAATTACCAGTGTGATACATAGGAAT CATTAT TC 
AGAATGTAGTCTGGTCTT TAGGAAGTATTAATAAGAAAATTTGCACATAACTTAGT T GATTCAGA 
AAGGACTTGTATGCTGTTTTTCTCCCAAATGAAGACTCTTTTTGACACTAAACACTTTTTAAAAA 
GC T TATC T T TGCCT TCT CCAAACAAGAAGCAATAGT CT CCAAGTCAATATAAAT TCTACAGAAAA 
TAGT GT TCT T T T T CTCCAGAAAAATGC T TGTGAGAAT CAT T AAAACATGTGACAAT T TAGAGAT T 
CT T T GT T T TAT TT CACT GATT AATATACTGTGGCAAATTACACAGAT TAT TAAAT T T T T T TACAA 
GAGTATAGTATAT T TAT T TGAAAT GGGAAAAGTGCAT T T TACTGTAT T T T GTG T AT T T TGT T TAT 
TTC TCAGAATATGGAAAGAAAATTAAAATGT GT CAAT AAATAT TT TC TAGAGAGTAA 
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MAREDSWCLRCLLYALNLLFWMSISVLAVSAWMRDYLNNVLTLTAETRVEEAVILTYFPVVHP 
VM IAVCCFL 1 1 VGMLGYCGT VKRNLLLLAWYFGS LLVI FCVELACGWTYEQELMVPVQWSDMVT 
LKARMTNYGL PRYRWLTHAWNFFQRE FKCCGWY FTDWLEMTEMDWPPDSCCVRE FPGCSKQAHQ 
EDLSDL YQEGCGKKMYS FLRGTKQLQVLRFLG I S I GVT Q I LAMI LT I TLLWAL YYDRRE PGTDQM 
MSLKNDNSQHLSCPSVELLKPSLSRIFEHTSD1ANSFNTHFEMEEL 

Signal peptide: 

amino acids 1-33 

Transmembrane domains: 

amino acids 12-35, 57-86, 94-114, 226-248 
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CCAAGGCCAGAGCTGTGGACACCTTATCCCACTCATCCTCATCCTCTTCCTCTGATAAAGCCCCTACCAGTGCT 

GAT AAAGT CT TT CT C GT GAGAGC C T AGAGGC C T T AAAAAAAAAAGT GCT T GAAAGAGAAGGGGAC AAAGGAAC A 
CCAGTATTAAGAGGATTTTCCAGTGTTTCTGGCAGTTGGTCCAGAAGGATGCCTCCATTCCTGCTTCTCACCTG 
CCTCTTCATCACAGGCACCTCCGTGTCACCCGTGGCCCTAGATCCTTGTTCTGCTTACATCAGCCTGAATGAGC 
C CT GGAGGAAC AC T G ACCAC C AGT T GG AT GAGT CT C AAG GTCC T CC T CT AT GT GAC AACC AT GT GAATGGGGAG 
TGGTACCACTTCACGGGCATGGCGGGAGATGCCATGCCTACCTTCTGCATACCAGAAAACCACTGTGGAACCCA 
CGCACCTGTCTGGCTCAATGGCAGCCACCCCCTAGAAGGCGACGGCATTGTGCAACGCCAGGCTTGTGCCAGCT 
T C AAT GGGAACT GCT GT CT CT GGAAC ACCAC GGT G GAAGT CAAGGCT T GCCCT GGAGGCT AC T AT GT GT ATC GT 
CT GACCAAGC CC AGC GT CTGC T T C C AC GT CT ACT GT GGT CAT T TT T AT GAC ATCT GCGAC GAGGACT GCC AT GG 
C AGCT GC T C AGAT AC CAGC GAGT GC AC AT GC GCT CC AGG AACT GT GCT AGGCCCT G AC AGGC AGAC AT GCTT T G 
AT GAAAAT GAAT GTGAGCAAAAC AACGGT GGC T GC AGT GAGAT CT GT GT GAACCT C AAAAACTC CT AC C GCT GT 
GAGTGTGGGGTT GGCCGTGTGCT AAGAAGT GATGGCAAGACTT GTGAAGACGTT GAAGGATGCCACAAT AAC AA 
%j TGGTGGCTGCAGCCACTCTTGCCTTGGATCTGAGAAAGGCTACCAGTGTGAATGTCCCCGGGGCCTGGTGCTGT 

I ;: L C T GAGGAT AAC C AC ACTT GCCAAGTCC CT GT GT T GT GC AAATC AAAT GC C AT T GAAGT GAACAT CC CC AGGGAG 

;; " ■ 

|!| CT GGT T GGTGGCCTG GAGC TCTT CCT GACC AACAC CT CC T GC C GAGGAGT GTCC AAC GGC AC CC AT GT C AAC AT 

« CCTCTTCTCTCT C AAGAC AT GT GGT AC AGT GGTC GAT GT GGT GAAT GAC AAGAT T GT GGC CAGC AACCT CGT GA 

O C AG GT CT ACCC AAGC AGAC CC C G GGGAGCAGC GGGGACT T CAT CATCC GAACCAGC AAGC T GCT GAT C C C GGTG 

Iff ACCTGCGAGTTTCCACGCCTGTACACCATTTCTGAAGGATACGTTCCCAACCTTCGAAACTCCCCACTGGAAAT 
CATGAGCCGAAATCATGGGATCTTCCCATTCACTCTGGAGATCTTCAAGGACAATGAGTTTGAAGAGCCTTACC 
GGGAAGCTCTGCCCACCCTCAAGCTTCGTGACTCCCTCTACTTTGGCATTGAGCCCGTGGTGCACGTGAGCGGC 
T T G GAAAGCT T GGT G GAGAGCT GCTT T GCCAC CCCCAC C TCCAAGATC GAC GAGGT C CT GAAAT ACT ACC T C AT 
CCGGGATGGCTGTGTTTCAGATGACTCGGTAAAGCAGTACACATCCCGGGATCACCTAGCAAAGCACTTCCAGG 
TCCCT GT C TT C AAGT T T GT GGGC AAAG ACC AC AAG GAAGT GTT T CT GCACT GCC GGGTT C T T GT CT GT GGAGT G 
TTGGACGAGCGTTCCCGCTGTGCCCAGGGTTGCCACCGGCGAATGCGTCGTGGGGCAGGAGGAGAGGACTCAGC 
CGGT CT AC AGGGC C AGAC GCT AAC AGGC GGCC C GATC C GC AT C GACT GGGAG GACTAGT T C GT AGC C AT AC CTC 
GAGTCCCTGCATTGGACGGCTCTGCTCTTTGGAGCTTCTCCCCCCACCGCCCTCTAAGAACATCTGCCAACAGC 
T GGGTTC AGAC T T C ACACT GT GAGTT CAGACTCC CAGC ACC AACT C AC T C T GAT T C T GGT C CAT T C AGT GGGCA 
CAGGTCACAGCACTGCTGAACAATGTGGCCTGGGTGGGGTTTCATCTTTCTAGGGTTGAAAACTAAACTGTCCA 
CCCAGAAAGACACTCACCCCATTTCCCTCATTTCTTTCCTACACTTAAATACCTCGTGTATGGTGCAATCAGAC 
CACAAAATCAGAAGCTGGGT AT AAT ATTTCAAGTTACAAACCCT AGAAAAATTAAACAGT T ACTGAAATT AT GA 
CT T AAAT ACC C AAT GACT C CT T AAAT AT GT AAAT TAT AGT TAT ACCTT GAAAT T TC AAT T C AAAT GCAGACT AA 
TT ATAGGGAAT TTGGAAGT GTAT CAAT AAAACAGTATAT AATT TT 
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MPPFLLLTCLFITGTSVSPVALDPCSAYISLNEPWRNTDHQLDESQGPPLCDNHVNGEWYHFTGMAGDAMP 
TFCIPENHCGTHAPVWLNGSHPLEGDGIVQRQACASFNGNCCLWNTTVEVKACPGGYYVYRLTKPSVCFHV 
YCGHFYDICDEDCHGSCSDTSECTCAPGTVLGPDRQTCFDENECEQNNGGCSEICVNLKNSYRCECGVGRV 
LRSDGKTCEDVEGCHNNNGGCSHSCLGSEKGYQCECPRGLVLSEDNHTCQVPVLCKSNAIEVNIPRELVGG 
LELFLTNTSCRGVSNGTHVNILFSLKTCGTWDVVNDKIVASNLVTGLPKQTPGSSGDFIIRTSKLLIPVT 
CEFPRLYTISEGYVPNLRNSPLEIMSRNHGIFPFTLEIFKDNEFEEPYREALPTLKLRDSLYFGIEPWHV 
SGLESLVESCFATPTSKIDEVLKYYLIRDGCVSDDSVKQYTSRDHLAKHFQVPVFKFVGKDHiffiVFLHCRV 
LVCG VLDE RS RCAQGCHRRMRRGAGG E D S AGL QGQTLT GG P I RI DWED 

Important features of the protein: 
Signal peptide: 

amino acids 1-16 

N-glycosylation sites. 

amino acids 89-93, 116-120, 259-263, 291-295, 299-303 

Tyrosine kinase phosphorylation sites. 

amino acids 411-418, 443-451 

N-myristoylation sites. 

amino acids 226-232, 233-239, 240-246, 252-258, 296-302, 300-306, 
522-528, 531-537 

Aspartic acid and asparagine hydroxylation site. 

amino acids 197-209 

ZP domain proteins. 

amino acids 431-457 

Calcium-binding EGF-like proteins. 

amino acids 191-212, 232-253 
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GAGAGAGGCAGC AGCTT GCT CAGC GGAC AAGGAT GCTGGG CGTGAGGG ACC AAGGCCTGCCCTGCACTCGG 
GCCT CCT C CAGC CAGT GCTGAC C AGG GACT T C TG ACCTGC TGGCCAGCCAGGACCTGT GT GGGG AGGCCC T 
CCTGCTGCCTTGGGGTGACAATCTCAGCTCCAGGCTACAGGGAGACCGGGAGGATCACAGAGCCAGCATGT 
T ACAGGAT CCT G AC AGTG AT CAACCT C T GAAC AGCCT CGATGTCAAAC CCCT GCGC AAACCCCGTATCCC C 
ATGGAGAC CT TCAGAAAGGT GGGGAT CC CCAT CATCAT AGCACT ACTGAGCCTGGCGAGT ATCATC ATTGT 
GGTTGTCCTCATCAAGGTGATTCTGGATAAATACTACTTCCTCTGCGGGCAGCCTCTCCACTTCATCCCGA 
GGAAGCAGCT GTGT GACGGAGAGCTGGACTGT CCCT TG GGGGAGGACGAGGAGC ACTGTGTCAAGAGCTTC 
CCCGAAGGGCCTGCAGTGGCAGT CCGCC TCT C CAAGGACCGATCCACACTGC AGGT GCTGGACT CGGCCAC 
AGGGAACTGGTTCTCTGCCTGTTTCGACAACTTCACAGAAGCTCTCGCTGAGACAGCCTGTAGGCAGATGG 
GCTACAGCAGAGCT GTGG AGAT TGGC CC AGACCAGGAT CT GGATGT TGTT GAAATCACAGAAAACAGCCAG 
GAGCTTCGCATGCGGAACTCAAGTGGGCCCTGTCTCTCAGGCTCCCTGGTCTCCCTGCACTGTCTTGCCTG 
TGGGAAGAGCCTGAAGACCCCCCGTGTGGTGGGTGGGGAGGAGGCCTCTGTGGATTCTTGGCCTTGGCAGG 
TC AGCATCCAGT ACGACAAAC AGCACGT CTGT GG AGGGAGCATCCT GG ACCCCCACTGGGTC CT CACGGC A 
GC CCACTGCT T CAGGAAAC AT ACCGATGTGTT CAACT G GAAGGT GC GGGC AGGCTCAGACAAACTGGGCAG 
CT T C CCAT CCCTGGCTGTGGCC AAGATC ATCATCAT T GAATT CAACCC CAT GT ACC CCAAAGACAATGAC A 
TCGCCCTCATGAAGCTGCAGTTCCCACTCACTTTCTCAGGCACAGTCAGGCCCATCTGTCTGCCCTTCTTT 
GATG AGG AGCTCAC TCCAGCCACCCC AC TCTGGATCAT TGGATGGGGCTT T ACG AAGC AGAATG GAGGGAA 
GAT GTCT GACAT AC TGCTGCAGGCGT C AGTCC AGGTCATT GACAGC AC ACGGTGCAAT GCAGACGATGCGT 
ACCAGGGGGAAGTCACCGAGAAGATGATGTGTGCAGGCATCCCGGAAGGGGGTGTGGACACCTGCCAGGGT 
GAC AGT GGTGGGCCCCT G ATGT ACCAAT CTGACC AGTGGC ATGT GGT GGGCATCGT T AGCTGGGGCTATGG 
CTGCGGGGGCCCGAGCACCCCAGGAGTATACACCAAGGTCTCAGCCTATCTCAACTGGATCTACAATGTCT 
GG AAGGCTGAGCTGTAATGCTGCTGCCCCTTTGCAGTGCTGGGAGCCGCTTCCTTCCTGCCCTGCCCACCT 
GGGGAT CCCCCAAAGT CAGACAC AGAGC AAGAGT CCCC TT GGGT AC ACCC CT CTGC CCAC AGCCTC AGCAT 
TTCTTGGAGCAGCAAAGGGCCTCAATTCCTGTAAGAGACCCTCGCAGCCCAGAGGCGCCCAGAGGAAGTCA 
GCAGCCCTAGCTCGGCCACACTTGGTGCTCCCAGCATCCCAGGGAGAGACACAGCCCACTGAACAAGGTCT 
CAGGGGT ATT GCT AAGCCAAGAAGGAACTTTC CCAC ACT ACTGAAT GGAAGC AGGC TGTCT T GT AAAAGC C 
CAGATC AC TGTGGGCT GGAGAGGAGAAGGAAAGGGT CTGC GC CAGCCCTGTCCGT C TT CACCCAT CCCCAA 
GCCTACTAGAGCAAGAAACCAGTTGTAATATAAAATGCACTGCCCTACTGTTGGTATGACTACCGTTACCT 
ACTGTT GT C AT TGT TAT T AC AGCT AT GG CCACTATT AT TAAAGAGCTGTGT AAC AT CT CTGGCAAAAAAAA 
AAAA 
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MLQDPDS DQPLNS LDVKPLRKPRI PME T FRKVG IPIIIALLSLASII I VWL I KVI LDKY YFLCG 
QPLHFIPRKQLCDGELDCPLGEDEEHCVKS FPEGPAVAVRLSKDRSTLQVLDSATGNWFSACFDN 
FTEALAETACRQMGYSRAVEIGPDQDLDWEITENSQELRMRNSSGPCLSGSLVSLHCLACGKSL 
KTPRWGGEEASVDSWPWQVSIQYDKQHVCGGSILDPHWLTAAHCFRKHTDVFNWKVRAGSDKL 
GSFPSLAVAKI 1 1 IEFNPMYPKDNDIALMKLQFPLTFSGTVRPICLPFFDEELTPATPLWI IGWG 
FTKQNGGKMSDILLQASVQVIDSTRCNADDAYQGEVTEKMMCAGIPEGGVDTCQGDSGGPLMYQS 
DQWHWG I VS WGYGCGGP ST PGVYTKVS AYLNW I YNVWKAE L 

Transmembrane domain: 

amino acids 32-53 (typell) 
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GGCTGGACTGGAACTCCTGGTCCCAAGTGATCCACCCGCCTCAGCCTCCCAAGGTGCTGTGATTA 
TAGGTGTAAGCCACCG TGTCTGGCCTCT GAACAAC TT TT TCAGCAACTAAAAAAGCCACAGGAGT 
TGAACTGCTAGGATTCTGACTATGCTGTGGTGGCTAGTGCTCCTACTCCTACCTACATTAAAATC 
TGTTTTTTGTTGTCTTGTAACTAGCCTTTACCTTCCTAACACAGAGGATCTGTCACTGTGGCTCT 
GGCCCAAACC TGACCT TCAC TC TGGAACGAGAACAGAGGTT T CTACCCACACCGT CCCCTCGAAG 
CCGGGGACAGCCTCACCTTGCTGGCCTCTCGCTGGAGCAGTGCCCTCACCAACTGTCTCACGTCT 
GGAGGCACTGACTCGGGCAGTGCAGGTAGCTGAGCCTCTTGGTAGCTGCGGCTTTCAAGGTGGGC 
J;" CTTGCCCTGGCCGTAGAAGGGATTGACAAGCCCGAAGATTTCATAGGCGATGGCTCCCACTGCCC 
SE- AGGCATCAGCCTTGCTGTAGTCAATCACTGCCCTGGGGCCAGGACGGGCCGTGGACACCTGCTCA 
§| GAAGCAGTGGGT GAGACAT CACGC T GCCCGCCCAT CTAACCT T TTCATGTCC T GCACAT CACCTG 

|r| ATCCATGGGCTAATCTGAACTCTGTCCCAAGGAACCCAGAGCTTGAGTGAGCTGTGGCTCAGACC 
% 4 CAGAAGGGGTCT GCTTAGACCACC TGGT T TAT GT GACAGGAC T TGCATTC TCCT GGAACATGAGG 

t'' GAACGCCGGAGGAAAGCAAAGTGGCAGGGAAGGAACT T GT GCCAAAT TAT GGGTCAGAAAAGATG 

Oil 

GAGGTGTTGGGTTATCACAAGGCATCGAGTCTCCTGCATTCAGTGGACATGTGGGGGAAGGGCTG 
fl CCGATGGCGCATGACACACTCGGGACTCACCTCTGGGGCCATCAGACAGCCGTTTCCGCCCCGAT 
if! CCACGTACCAGCTGCTGAAGGGCAACTGCAGGCCGATGCTCTCATCAGCCAGGCAGCAGCCAAAA 
Kl TCTGCGATCACCAGCCAGGGGCAGCCGTCTGGGAAGGAGCAAGCAAAGTGACCATTTCTCCTCCC 
CTCCTTCCCTCTGAGAGGCCCTCCTATGTCCCTACTAAAGCCACCAGCAAGACATAGCTGACAGG 
3l GGCTAATGGCTCAGTGTTGGCCCAGGAGGTCAGCAAGGCCTGAGAGCTGATCAGAAGGGCCTGCT 
GT GCGAACACGGAAATGCCTCCAGTAAGCACAGGC T GCAAAAT CCCCAGGCAAAGGACTGT GT GG 
CTCAAT T TAAAT CATGT TCTAGTAAT TGGAGCTGT CCCCAAGACCAAAGGAGCTAGAGCT T GGT T 
CAAATGATCTCCAAGGGCCCTTATACCCCAGGAGACTTTGATTTGAATTTGAAACCCCAAATCCA 
AACCTAAGAACCAGGTGCATTAAGAATCAGTTATTGCCGGGTGTGGTGGCCTGTAATGCCAACAT 
TTTGGGAGGCCGAGGCGGGTAGATCACCTGAGGTCAGGAGTTCAAGACCAGCCTGGCCAACATGG 
TGAAACCCCTGTCTCTACTAAAAATACAAAAAAACTAGCCAGGCATGGTGGTGTGTGCCTGTATC 
CCAGCTACTCGGGAGGCT GAGACAGGAGAAT TACT T GAACCT GGGAGGTGAAGGAGGCTGAGACA 
GGAGAATCACT TCAGC C T GAGCAACACAGCGAGACT CT GT CT CAGAAAAAATAAAAAAAGAAT TA 
TGGTTATTTGTAA 
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FIGURE 114 

MLWWLVLLLLPTLKSVFCSLVTSLYLPNTEDLSLWLWPKPDLHSGTRTEVSTHTVPSKPGTASPC 
WPLAGAVPSPTVSRLEALTRAVQVAEPLGSCGFQGGPCPGRRRD 

Signal peptide: 

amino acids 1-15 
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FIGURE lis 

CAGCAGTGGTCT CT CAGTC CT C TCAAAGCAAGGAAAGAGTACTGT GTGC TGAGAGAC CATG GCAA 
AGAATCCTCCAGAGAAT TGTGAAGACTGTCACAT TCTAAATGCAGAAGCTTT TAAATCCAAGAAA 
ATATGTAAATCACTTAAGATTTGTGGACTGGTGTTTGGTATCCTGGCCCTAACTCTAATTGTCCT 
GTT.TTGGGGGAGCAAGCACTTCTGGCCGGAGGTACCCAAAAAAGCCTATGACATGGAGCACACTT 
TC TACAGCAAT GGAGAGAAGAAGAAGAT TTACAT GGAAAT T GATCC TGT GACCAGAAC TGAAATA 
TTCAGAAGCGGAAATGGCACTGATGAAACATTGGAAGTGCACGACTTTAA?AACGGATACACTGG 
CATCTACTTCGTGGGTCTTCAAAAATGTTTTATCAAAACTCAGATTAAAGTGATTCCTGAATTTT 
C T GAAC CAG AAG AG G AAAT AG AT G AGAAT GAAG AAAT T AC C AC AAC T T T C T T T G AAC AG T C AG T G . 
ATTTGGGTCCCAGCAGAAAAGCCTATTGAAAACCGAGATTTTCTTAAAAATTCCAAAATTCTGGA 
GAT TTGTGATAACGTGACCAT GTAT TGGATCAATCCCACTCTAATATCAGT T TCT GAGT TACAAG 
ACT TTGAGGAGGAGGGAGAAGAT C T TCACT T T CC TGCCAACGAAAAAAAAGGGAT T GAACAAAAT 
GAACAGTGGGTGGTCCCTCAAGTGAAAGTAGAGAAGACCCGTCACGCCAGACAAGCAAGTGAGGA 
AGAAC T TCCAATAAAT GACTATAC TGAAAATGGAATAGAATT TGATCCCATGC TGGAT GAGAGAG 
GTTATTGTTGTATTTACTGCCGTCGAGGCAACCGCTATTGCCGCCGCGTCTGTGAACCTTTACTA 
GGCTACTACCCATATCCATACTGCTACCAAGGAGGACGAGTCATCTGTCGTGTCATCATGCCTTG 
TAACTGGTGGGTGGCCCGCATGCTGGGGAGGGTC TAA TAGGAGGTTTGAGCTCAAATGCTTAAAC 
TGC TGGCAACATATAAT AAAT GCATGC TAT TCAATGAAT T TC TGCC TATGAGGCAT CTGGCCCC T 
GGTAGCCAGCTCTCCAGAATTACTTGTAGGTAATTCCTCTCTTCATGTTCTAATAAACTTCTACA 
TTATCACCAAAAAAAAAAAAAAAAAAA 
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FIGURE 116 



MAKNPPENCEDCHILNAEAFKSKKICKSLKICGLVFGIIiALTLIVLFWGSKHFWPEVPKPCAYDME 
HTFYSNGEKKKIYMEIDPVTRTEIFRSGNGTDETLEVHDFKNGYTGIYFVGLQKCFIKTQIKVIP 
EFSEPEEEIDENEEITTTFFEQSVIWVPAEKPIENRDFIiKNSKILEICDNVTMYWINPTLISVSE 
LQDFEEEGEDLHFPANEKKGIEQOTQWWPQVKVEKTRHARQASEEELPINDYTENGIEFDPMLD 
ERGYCCIYCRRGNRYCRRVCEPLLGYYPYPYCYQGGRVICRVIMPCNWWVARMLGRV 

Important features of the protein: 
Signal peptide: 

amino acids 1-40 

Transmembrane domain: 

amino acids 25-47 (type II) 

N-glycosylation sites. 

amino acids 94-97, 180-183 

Glycosaminoglycan attachment sites. 

amino acids 92-95, 70-73, 85-88, 133-136, 148-151, 192-195, 239- 

242 

N-myristoylation sites. 

amino acids 33-38, 95-100, 116-121, 215-220, 272-277 

Microbodies C-terminal targeting signal. 

amino acids 315-317 

Cytochrome c family heme-binding site signature. 

amino acids 9-14 
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FIGURE 117 



GAGCTCCCCTCAGGAGCGCGTTAGCTTCACACCTTCGGCAGCAGGAGGGCGGCAGCTTCTCGCAGGCGGCA 
GGGC GGGC GGCCAGGAT CATGTCC ACCACCAC AT GCC AAGT GGT GGCGT T CC TCCTGT CC ATCCTGGGGCT 
GGCC GGCT GCATCGCGGCCACC GGGAT GGACATGTGGAGC AC CCAGGACC TGT ACGAC AACCCCGT CACCT 
CC GT GTTCCAGT ACGAAGGGCT CTGGAGGAGC TGCGT GAGGC AGAGTT CAGGCT TC ACCGAATGCAGGCCC 
T ATT TC AC CATCCTGGGACT TCCAGC CATGCT GC AGGC AGT GCG AGCC CT GATGAT CGT AGGC ATCGTCC T 
GGGT GCC ATTGGCCTCCTGGT ATCCATCTTT GCCCT GAAATGC ATCCGCAT T GGCAGC AT GGAGGACTCT G 
CCAAAGCCAACATGACACTGACCTCCGGGATCATGTTCATTGTCTCAGGTCTTTGTGCAATTGCTGGAGTG 
s ; TCTGT GT T TGCC AACATGCT GGTGACT AACT TCT GG AT GT CC AC AGCTAACATGT ACACCGGCATGGGTGG 

£3 GATGGTGCAGACTGTTCAGACCAGGTACACATTTGGTGCGGCTCTGTTCGTGGGCTGGGTCGCTGGAGGCC 
r| TC AC ACT AAT TGGGGGTGTGATGATGTGCATC GCCTGC CGGGGCCT GGCACC AGAAGAAACC AACT AC AAA 

|l! GCCGTTTCTTATCATGCCTCAGGCCACAGTGTTGCCTACAAGCCTGGAGGCTTCAAGGCCAGCACTGGCTT 
W TGGGTCCAAC ACCAAAAACAAGAAGAT AT ACGATGG AGGT GCCCGC AC AGAGGACGAGGT AC AAT C TT AT C 

% l CTTCCAAGCACGACT AT GTG TAAT GCTC TAAGACCTCT CAGCACGGGCGGAAGAAACT CCCGGAGAGCTCA 

CCCAAAAAACAAGGAGATCCCATCTAGATTTCTTCTTGCTTTTGACTCACAGCTGGAAGTTAGAAAAGCCT 
CGAT TTCATCTT T GGAGAGG CCAAAT GGTCTT AGCCT C AGT CTCTGTCTCT AAAT ATT CCACCAT AAAACA 
GC TG AGTTAT TT ATGAAT T AGAGGCT AT AGCTCACATTTT CAAT CCTCT ATT TCTT TT TTT AAAT AT AACT 
TT CT ACTCTG AT GAGAGAAT GTGGTTTT AAT C TCTCTCTC AC AT TTT GAT GATTT AGACAGACT CCCC CT C 
T T CC TC CT AGTCAAT AAACC CATTGATGATCT AT TTCCCAGCTTAT CCCC AAGAAAACTT TTGAAAGG AAA 
GAGT AGAC CC AAAGAT GTT AT T TT CT GCTGT T TGAAT T TT GTCT CCCC ACCCCC AACT TGGCTAGT AAT AA 
AC ACT T AC TGAAGAAGAAGC AATAAGAG AAAGAT ATT T GT AATC TCTCC AGCCC ATGATCTCGGT TTT CT T 
ACACTGTGATCTTAAAAGTTACCAAACCAAAGTCATTTTCAGTTTGAGGCAACCAAACCTTTCTACTGCTG 
TTGACATCTTCTTATTACAGCAACACCATTCTAGGAGTTTCCTGAGCTCTCCACTGGAGTCCTCTTTCTGT 
CGCGGGTC AGAAAT TGTC CCTAGATGAATGAGAAAATT ATTTTT TT TAAT TT AAGT CCT AAAT AT AGT T AA 
AAT AAAT AATGT T T T AGT AAAATG AT AC ACT ATC TCTGTGAAAT AGCCTCAC CCCT ACATGT GGAT AGAAG 
GAAATGAAAAAAT AAT TGCT TTGACATT GTCT AT ATGGTACTTT GT AAAGTC ATGC TT AAGT AC AAAT TCC 
AT GAAAAGCT C ACACC TGTAATCCTAGC ACT TTGGGAGGCT GAGGAGGAAGGATC ACTTGAGCC CAGAAGT 
TCGAGACT AGCCT GGGCAAC ATGG AGAAGCCCTGTCTC TAC AAAAT AC AGAGAGAAAAAATC AGCC AG TCA 
TGGT GGCAT ACACCT GT AGT CC CAGC AT TCCGGGAGGCTGAGGT GGGAGGAT C ACT TGAGCCCAGGGAGGT 
TGGGGCT GCAGT GAGCCATGAT C ACACC ACT GC ACTCCAGCC AGGT GACAT AGCGAGATCCT GT CT AAAAA 
AAT AAAAAAT AAAT AAT GGAAC AC AGCAAGTC CT AG GAAGT AGGTT AAAACT AATT CT TT AA 
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FIGURE 118 



MSTTTCQWAFLLSILGLAGCIAATGMDMWSTQDLYDNPVTSVFQYEGLWRSCVRQSSGFTECRP 
YFTILGLPAMLQAVRAIiMIVGIVLGAIGLLVSIFALKCIRIGSMEDSAKANMTLTSGIMFIVSGL 
CAI AGVS VFANMLVTNFWMS T ANMYTGMGGMVQTVQTRYT FGAAL FVGWVAGGLT L I GGVMMC I A 
CRGLAPEETNYKAVSYHASGHSVAYKPGGFKASTGFGSNTKNKKIYDGGARTEDEVQSYPSKHDY 
V 

Signal peptide: 

amino acids 1-23 

Transmembrane domains: 

amino acids 81-100, 121-141, 173-194 
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FIGURE HQ 



GGAAAAACTGTTCTCTTCTGTGGCACAGAGAACCCTGCTTCAAAGCAGAAGTAGCAGTTCCGGAGTCC 
AGC T G GCT AAAAC T CAT C C CAGAGGAT AATGG CAACC CAT G C CT T AGAAAT CGCTGGGCTGTTTCTTG 
GTGGTGTTGGAATGGTGGGCACAGTGGCTGTCACTGTCATGCCTCAGTGGAGAGTGTCGGCCTTCATT 
GAAAACAACAT CGTGGTTT TT GAAAACTTCT GGGAAGGACTGTGGAT GAATTGCGT GAGGCAGGCTAA 
CATCAGGATGCAGTGCAAAATCTATGATTCCCTGCTGGCTCTTTCTCCGGACCTACAGGCAGCCAGAG 
GACTGATGTGTGCTGCTTCCGTGATGTCCTTCTTGGCTTTCATGATGGCCATCCTTGGCATGAAATGC 
ACCAGGTGCACGGGGGACAATGAGAAGGTGAAGGCTCACATTCTGCTGACGGCTGGAATCATCTTCAT 
CAT CACGGG CAT GGTGGTGCT CAT C C CT G TGAGC T GGGT T GCCAATGC C AT CAT CAGAGAT TT CT ATA 
ACT CAATAGTGAATGT TGCCCAAAAACGTGAGCTTGGAGAAGCT CTCTACT TAGGATGGACCACGGCA 
CTGGTGCTGATTGTTGGAGGAGCTCTGTTCTGCTGCGTTTTTTGTTGCAACGAAAAGAGCAGTAGCTA 
CAGATACTCGATACCTTCCCATCGCACAACCCAA?U^AAGTTATCACACCGGAAAGAAGTCACCGAGCG 
T CTACTCCAGAAGTCAGTATGTG TAG TT GTGTATGTT TTTTTAACTT TACTATAAAGCCAT GCAAAT G 
ACAAAAATCTATATTACTTTCTCAAAATGGACCCCAAAGAAACTTTGATTTACTGTTCTTAACTGCCT 
AAT CT TAAT TACAGGAACT GT GC AT CAGC T AT T TAT GAT T C TAT AAGC T AT T T CAGCAGAAT GAGATA 
TTAAACCCAATGCTTTGATTGTTCTAGAAAGTATAGTAATTTGTTTTCTAAGGTGGTTCAAGCATCTA 
CTCTT TTTATCAT TTACTT CAAAAT GACATT GCTAAAGACTGCAT TATT TTACTACT GTAATTT C TCC 
ACGACATAGCATT ATGTACATAGAT GAGTGTAACATT TATATCT CACAT AGAGACATGCTTATAT GGT 
TTTATTTAAAATGAAATGCCAGTCCATTACACTGAATAAATAGAACTCAACTATTGCTTTTCAGGGAA 
ATCATGGATAGGGTTGAAGAAGGTTACTATTAATTGTTTAAAAACAGCTTAGGGATTAATGTCCTCCA 
TTTATAATGAAGATTAAAATGAAGGCTT TAATCAGCATT GTAAAGGAAATT GAAT GGCT TTCTGATAT 
GCTGTTTTTTAGCCTAGGAGTTAGAAATCCTAACTTCTTTATCCTCTTCTCCCAGAGGCTTTTTTTTT 
C T T GT GT AT TAAATTAACATT TT TAAAAG GCAGATAT TT TGT CAAGGGG CT T T GCAT TC AAAC T GCT T 
TTCCAGGGCTATACTCAGAAGAAAGATAAAAGTGTGATCTAAGAAAAAGTGATGGTTTTAGGAAAGTG 
AAAATAT TTTTGT TTTTGTAT TT GAAG AAGAAT GAT GCATTTT GACAAGAAATCATATATGTATGGAT 
ATATTTTAATAAGTATTTGAGTACAGACTTTGAGGTTTCATCAATATAAATAAAAGAGCAGAAAAATA 
TGTCTTGGTTTTCATTTGCTTACCAAAAAAACAACAACAAAAAAAGTTGTCCTTTGAGAACTTCACCT 
GCTCCTATGTGGGTACCTGAGTCAAAATTGTCATTTTTGTTCTGTGAAAAATAAATTTCCTTCTTGTA 
CCATTTCTGTTTAGTTTTACTAAAATCTGTAAATACTGTATTTTTCTGTTTATTCCAAATTTGATGAA 
ACTGACAATCCAATTTGAAAGTTTGTGTCGACGTCTGTCTAGCTTAAATGAATGTGTTCTATTTGCTT 
TATACAT TT ATAT TAATAAATTGTACATT TT TCTAATT 
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FIGURE 120 



MATHALEIAGLFLGGVGMVGTVAVTVMPQWRVSAFIENNIWFENFWEGLWMNCVRQANIRMQCK 
IYDSLLALSPDLQAARGLMCAASVMSF 

MWLI PVSWVANAI IRDFYNS I VNVAQKRELGEALYLGWT TALVL I VGGALFCCVFCCNEKSSSY 
RYSTPSHRTTQKSYHTGKKSPSVYSRSQYV 

Signal peptide: 

amino acids 1-17 



Q Transmembrane domains: 

Ms amino acids 82-101, 118-145, 164-188 
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FIGURE 121 



GGAGAGAGGCGCGCGGGTGAAAGGCGCATTGATGCAGCCTGCGGCGGCCTCGGAGCGCGGCGGAG 
CCAGACGCTGACCACGTTCCTCTCCTCGGTCTCCTCCGCCTCCAGCTCCGCGCTGCCCGGCAGCC 
GGGAGCCATGCGACCCCAGGGCCCCGCCGCCTCCCCGCAGCGGCTCCGCGGCCTCCTGCTGCTCC 
TGCTGCTGCAGCTGCCCGCGCCGTCGAGCGCCTCTGAGATCCCCAAGGGGAAGCAAAAGGCGCAG 
CTCCGGCAGAGGGAGGTGGTGGACCTGTATAATGGAATGTGCTTACAAGGGCCAGCAGGAGTGCC 
TGGTCGAGACGGGAGCCCTGGGGCCAATGTTATTCCGGGTACACCTGGGATCCCAGGTCGGGATG 
GATTCAAAGGAGAAAAGGGGGAATGTC TGAGGGAAAGCTT TGAGGAGTCCTGGACACCCAACTAC 
AAGCAG TGT TCATGGAGTT CATT GAAT TATGGCATAGATC TT GGGAAAAT TGCGGAGT GTACAT T 
TACAAAGATGCGTTCAAATAGTGCTCTAAGAGTTTTGTTCAGTGGCTCACTTCGGCTAAAATGCA 
GAAATGCATGCTGTCAGCGTTGGTATTTCACATTCAATGGAGCTGAATGTTCAGGACCTCTTCCC 
AT TGAAGCTATAAT T TATT T GGACCAAGGAAGCCCT GAAATGAAT TCAACAAT TAATATT CAT CG 
CACTTCTTCTGTGGAAGGACTTTGTGAAGGAATTGGTGCTGGATTAGTGGATGTTGCTATCTGGG 
T T GGCAC TTGT T CAGAT TACCCAAAAGGAGATGCT TC TACT GGAT GGAAT TCAGT T TCTCGCAT C 
AT TAT T GAAGAACTACCAAA ATAAA TGC TT TAAT T TTCATT TGCTACC TC T TT TT TT AT TAT GCC 
T TGGAATGGT TCACT TAAAT GACAT T TTAAATAAGTT TATGTATACATC T GAATGAAAAGCAAAG 
tfl C TAAAT ATGT T TACAGACCAAAGTGTGATT T CACACT GT TT TTAAATC TAGCATTAT TCAT T T T G 

CT TCAAT CAAAAGTGGT TTCAATAT T T T TT TT AGT TGGT TAGAATACT T TC T T CATAGT CACAT T 
CTCTCAACCTATAATTTGGAATATTGTTGTGGTCTTTTGTTTTTTCTCTTAGTATAGCATTTTTA 
AAAAAATATAAAAGCTACCAATC T T TGTACAAT T T GTAAAT GT TAAGAAT TT T TT T TAT ATCT GT 
TAAATAAAAATTAT TTCCAACA 
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FIGURE 122 



MRPQGPAASPQRLRGLLLLLLLQLPAPSSASEIPKGKQKAQLRQREWDLYNGMCLQGPAGVPGR 
DGSPGANVIPGTPGIPGRDGFKGEKGECLRESFEESWTPNYKQCSWSSLNYGIDLGKIAECTFTK 
MRSNSALRVLFSGSLRLKCRNACCQRWYFTFNGAECSGPLPIEAIIYLDQGSPEMNSTINIHRTS 
SVEGLCEGIGAGLVDVAIWVGTCSDYPKGDASTGWNSVSRIIIEELPK 

Signal peptide: 

amino acids 1-30 

Transmembrane domain: 

amino acids 195-217 
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FIGURE 123 



GCTGAGCGTGTGCGCGGTACGGGGCTCTCCTGCCTTCTGGGCTCCAACGCAGCTCTGTGGCTGAA 
CTGGGTGCTCATCACGGGAACTGCTGGGCTATGGAATACAGATGTGGCAGCTCAGGTAGCCCCAA 
AT TGCCTGGAAGAATACAT CATGT T T T TCGAT AAGAAGAAAT TGTAGGATCCAGT T TTTTTTTTA 
ACCGCCCCCTCCCCACCCCCCAAAAAAACTGTAAAGATGCAAAAACGTAATATCCATGAAGATCC 
TATTACCTAGGAAGATTTTGATGTTTTGCTGCGAATGCGGTGTTGGGATTTATTTGTTCTTGGAG 
TGTTCTGCGTGGCTGGCAAAGAATAATGTTCCAAAATCGGTCCATCTCCCAAGGGGTCCAATTTT 
TCTTCCTGGGTGTCAGCGAGCCCTGACTCACTACAGTGCAGCTGACAGGGGCTGTCATGCAACTG 
GCCCCTAAGCCAAAGCAAAAGAC C TAAGGACGAC C T T T GAACAATACAAAGGATGGGT T T CAATG 
TAAT TAGGC TACT GAGCGGAT CAGCTGTAGCACT GGT TATAGCCCCCAC TGTC TTACTGACAAT G 
CTTTCTTCTGCCGAACGAGGATGCCCTAAGGGCTGTAGGTGTGAAGGCAAAATGGTATATTGTGA 
M ATCTCAGAAATTACAGGAGATACCCTCAAGTATATCTGCTGGTTGCTTAGGTTTGTCCCTTCGCT 
p ATAACAGCC T TCAAAAAC T TAAGTATAATCAAT T TAAAGGGC TCAACCAGC TCACCT GGC TATAC 

pi CT TGACCATAACCATATCAGCAATAT T GACGAAAATGCT T T TAAT GGAATACGCAGACTCAAAGA 

JjJ' GC TGAT TCT TAGTT CCAATAGAAT C TCCTAT T T TC TTAACAATACC T TCAGACCT GT GACAAAT T 

f J . TACGGAACTTGGATCTGTCCTATAATCAGCTGCATTCTCTGGGATCTGAACAGTTTCGGGGCTTG 
Ul CGGAAGCTGCTGAGTTTACATTTACGGTCTAACTCCCTGAGAACCATCCCTGTGCGAATATTCCA 
N| " AGACTGCCGCAACCTGGAACTTTTGGACCTGGGATATAACCGGATCCGAAGTTTAGCCAGGAATG 

L ; = : T CT TTGC TGGCAT GATCAGACTCAAAGAAC TT CACCT GGAGCACAATCAATT TT CCAAGC T CAAC 

CTGGCCCTTTTTCCAAGGTTGGTCAGCCTTCAGAACCTTTACTTGCAGTGGAATAAAATCAGTGT 
CATAGGACAGACCAT GT CCT GGACC TGGAGC T CC T TACAAAGGCT T GAT T TATCAGGCAAT GAGA 
*. TCGAAGCTTTCAGTGGACCCAGTGTTTTCCAGTGTGTCCCGAATCTGCAGCGCCTCAACCTGGAT 
TCCAACAAGC TCACAT T TAT T GGTCAAGAGAT TT TGGAT TC T TGGATATCCC TCAAT GACATCAG 
TCTTGCTGGGAATATATGGGAATGCAGCAGAAATATTTGCTCCCTTGTAAACTGGCTGAAAAGTT 
T TAAAGGT C TAAGGGAGAAT ACAAT TATC T GT GCCAGT CCCAAAGAGCT GCAAGGAG TAAATGTG 
AT CGAT GCAG TGAAGAACTACAGCATC TGT GGCAAAAGT ACTACAGAGAGGT T T GATCT GGCCAG 
GGCTCTCCCAAAGCCGACGTTTAAGCCCAAGCTCCCCAGGCCGAAGCATGAGAGCAAACCCCCTT 
TGCCCCC GACGGT GGGAGCCACAGAGCCCGGCCCAGAGACCGATGC TGACGCCGAGCACAT CT C T 
TTCCATAAAATCATCGCGGGCAGCGTGGCGCTTTTCCTGTCCGTGCTCGTCATCCTGCTGGTTAT 
CTACGTGTCATGGAAGCGGTACCCTGCGAGCATGAAGCAGCTGCAGCAGCGCTCCCTCATGCGAA 
GGCACAGGAAAAAGAAAAGACAGT CCC TAAAGCAAAT GACT CCCAGCACCCAGGAAT TT TATGTA 
GAT TATAAACCCACCAACACGGAGACCAGCGAGAT GCTGCT GAATGGGACGGGACCCTGCACC TA 
TAACAAATCGGGCTCCAGGGAGTGTGAGGTATGAACCATTGTGATAAAAAGAGCTCTTAAAAGCT 
GGGAAATAAGTGGTGCTTTATTGAACTCTGGTGACTATCAAGGGAACGCGATGCCCCCCCTCCCC 
TTCCCTCTCCCTCTCACTTTGGTGGCAAGATCCTTCCTTGTCCGTTTTAGTGCATTCATAATACT 
GGT CAT TTTCCTCT CATACATAAT CAACCCATT GAAAT T TAAATACCACAAT CAAT GTGAAGC T T 
GAACTCCGGTTTAATATAATACCTATTGTATAAGACCCTTTACTGATTCCATTAATGTCGCATTT 
GTT T TAAGATAAAACT T CT T T CATAGGTAAAAAAAAAAA 
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FIGURE 124 



|?B is 



m 



III 



MGFNVIRLLSGSAVALVIAPTVLLTMLSSAERGCPKGCRCEGKMVYCESQKLQEIPSSISAGCLG 

LSLRYNSLQKLKYNQFKGLNQLTWLYLDHNHISNIDENAFNGIRRLKELILSSNRISYFLNNTFR 

PVTNLRNLDLSYNQLHSLGSEQFRGLRKLLSLHLRSNSLRTIPVRIFQDCRNLELLDLGYNRIRS 

LARNVFAGMIRLKELHLEHNQFSKLNLALFPRLVSLQNLYLQWNKISVIGQTMSWTWSSLQ 

SGNE IEAFSGPSVFQCVPNLQRLNLDSNKLTFIGQE I LDSWI SLNDI SLAGNIWECSRNICSLVN 

WLKSFKGLRENTIICASPKELQGVOTIDAV^ 

S KPPLP PTVGATE PGPE TDADAEHI S FHKI I AGS VAL FLS VLVI LLVI YVSWKRYPASMKQLQQR 
SLMRRHRKKKRQSLKQMTPSTQEFYVDYKPTNTETSEMLLNGTGPCTYNKSGSRECEV 



01 Important features of the protein: 

W Signal peptide: 

s T amino acids 1-33 



Transmembrane domain: 

amino acids 420-442 

N-glycosylation sites. 

amino acids 126-129, 357-360, 496-499, 504-507 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 4 65-4 68 

Tyrosine kinase phosphorylation site. 

amino acids 136-142 



N-myristoylation sites. 

amino acids 11-16, 33-38, 245-250, 332-337, 497-502, 507-512 
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FIGURE 125 

CCGTTATCGTCTTGCGCTACTGCTGAATGTCCGTCCCGGAGGAGGAGGAGAGGCTTTTGCCGCTG 
ACCCAGAGATGGCCCCGAGCGAGCAAATTCCTACTGTCCGGCTGCGCGGCTACCGTGGCCGAGCT 
AGCAACC T T T CCCCTGGAT CT CACAAAAAC T CGAC TCCAAAT GCAAGGAGAAGCAGCT C T TGC TC 
GGTTGGGAGACGGTGCAAGAGAATCTGCCCCCTATAGGGGAATGGTGCGCACAGCCCTAGGGATC 
AT TGAAGAGGAAGGCT T TCTAAAGCT T TGGCAAGGAGTGACACCCGCCAT T TACAGACACGTAGT 
GTATTCTGGAGGTCGAATGGTCACATATGAACATCTCCGAGAGGTTGTGTTTGGCAAAAGTGAAG 
ATGAGCATTATGCCCTTTGGAAATCAGTCATTGGAGGGATGATGGCTGGTGTTATTGGCCAGTTT 
TTAGCCAATCCAACTGACCTAGTGAAGGTTCAGATGCAAATGGAAGGAAAAAGGAAACTGGAAGG 
AAAACCATTGCGATTTCGTGGTGTACATCATGCATTTGCAAAAATCTTAGCTGAAGGAGGAATAC 
GAGGGC T TTGGGCAGGC T GGGTACCCAATATACAAAGAGCAGCAC T GGTGAATAT GGGAGAT T TA 
ACCAC T TATGATACAGTGAAACACTAC TTGGTAT TGAATACACCAC T TGAGGACAATATCAT GAC 
TCACGGTTTATCAAGTTTATGTTCTGGACTGGTAGCTTCTATTCTGGGAACACCAGCCGATGTCA 
T CAAAAGCAGAATAAT GAATCAACCACGAGATAAACAAGGAAGGGGACT T T T GT ATAAAT CATCG 
AC TGAC TGCT TGAT TCAGGCT GT TCAAGGTGAAGGAT T CAT GAGTC TATATAAAGGCTT T TTACC 
ATCTTGGCTGAGAATGACCCCTTGGTCAATGGTGTTCTGGCTTACTTATGAAAAAATCAGAGAGA 
TGAGTGGAGTCAGTCCATTTTAA 
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FIGURE 126 



MSVPEEEERLLPLTQRWPRASKFLLSGCAATVAELATFPLDLTKTRLQMQGEAALARLGDGARES 
APYRGMVRTALGIIEEEGFLKLWQGVTPAIYRHWYSGGRMVTYEHLREWFGKSEDEHYPLWKS 
VIGGMMAGVIGQFLANPTDLVKVQMQMEGKRKLEGKPL^ 

NIQRAALVlsMGDLTTYDTVKHYLVLNTPLEDNIMTHGLSSLCSGLVASILGTPADVIKSRIMNQP 
RDKQGRGLLYKSSTDCLIQAVQGEGFMSLYKGFLPSWLRMTPWSMVFWLTYEKIREMSGVSPF 

Transmembrane domains: 

amino acids 25-38, 130-147, 233-248 
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FIGURE 127 

CGCGGATCGGACCCAAGCAGGTCGGCGGCGGCGGCAGGAGAGCGGCCGGGCGTCAGCTCCTCGAC 
CCCCGTGTCGGGCTAGTCCAGCGAGGCGGACGGGCGGCGTGGGCCCATGGCCAGGCCCGGCATGG 
AGCGGTGGCGCGACCGGCTGGCGCTGGTGACGGGGGCCTCGGGGGGCATCGGCGCGGCCGTGGCC 
CGGGCCCTGGTCCAGCAGGGACTGAAGGTGGTGGGCTGCGCCCGCACTGTGGGCAACATCGAGGA 
GC TGGC TGC TGAATGTAAGAGTGCAGGCTACCCCGGGACT T T GAT CCCC TACAGAT GT GACCTAT 
CA2\ATGAAGAGGACATCCTCTCCATGTTCTCAGCTATCCGTTCTCAGCACAGCGGTGTAGACATC 
TGCATCAACAATGCTGGCTTGGCCCGGCCTGACACCCTGCTCTCAGGCAGCACCAGTGGTTGGAA 
GGACATGT TGAAT G TGAACGTGC TGGCCCT CAGCATC T GCACACGGGAAGCC TACCAGT C CAT GA 
AGGAGCGGAAT GT GGACGATGGGCACAT CAT TAACAT CAATAGCATGTC TGGCCACCGAGTGTTA 
CCCCTGTCTGTGACCCACTTCTATAGTGCCACCAAGTATGCCGTCACTGCGCTGACAGAGGGACT 
GAGGCAAGAGCTTCGGGAGGCCCAGACCCACATCCGAGCCACGTGCATCTCTCCAGGTGTGGTGG 
AGACACAATTCGCCTTCAAACTCCACGACAAGGACCCTGAGAAGGCAGCTGCCACCTATGAGCAA 
ATGAAGTGTCTCAAACCCGAGGATGTGGCCGAGGCTGTTATCTACGTCCTCAGCACCCCCGCACA 
CATCCAGAT T GGAGACATCCAGATGAGGCCCACGGAGCAGGT GACC TAGTGAC TGTGGGAGCT CC 
TCCTTCCCTCCCCACCCTTCATGGCTTGCCTCCTGCCTCTGGATTTTAGGTGTTGATTTCTGGAT 
CACGGGATACCACTTCCTGTCCACACCCCGACCAGGGGCTAGAAAATTTGTTTGAGATTTTTATA 
TCATCTTGTCAAATTGCTTCAGTTGTAAATGTGAAA^ATGGGCTGGGGAAAGGAGGTGGTGTCCC 
TAATTGTTTTACTTGTTAACTTGTTCTTGTGCCCCTGGGCACTTGGCCTTTGTCTGCTCTCAGTG 
TCTTCCCTTTGACATGGGAAAGGAGTTGTGGCCAAAATCCCCATCTTCTTGCACCTCAACGTCTG 
TGGCTCAGGGCTGGGGTGGCAGAGGGAGGCCTTCACCTTATATCTGTGTTGTTATCCAGGGCTCC 
AGACTTCCTCCTCTGCCTGCCCCACTGCACCCTCTCCCCCTTATCTATCTCCTTCTCGGCTCCCC 
AGCCCAGTCTTGGCTTCTTGTCCCCTCCTGGGGTCATCCCTCCACTCTGACTCTGACTATGGCAG 
CAGAACACCAGGGCCTGGCCCAGTGGATTTCATGGTGATCATTAAAAAAGAAAAATCGCAACCAA 
AAAAAAAAAA 
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FIGURE 128 



MARPGMERWRDRLALVT GAS GG I GAAVARALVQQGLKWGCARTVGN I EELAAE CKS AG YPGT LI 
PYRCDLSNEEDILSMFSAIRSQHSGVDICINNAGLARPDTLL^ 

EAYQSMKERNVDDGHIIMINSMSGHRVLPLSVTHFYSATKYAVTALTEGLRQELREAQTHIRATC 
ISPGWETQFAFKLHDKDPEKAAATYEQMKCLKPEDVAEAVIYVLSTPAHIQIGDIQMRPTEQVT 

Important features of the protein: 
Signal peptide: 

amino acids 1-17 

N-myristoylation sites. 

amino acids 18-24, 21-27, 22-28, 24-30, 40-46, 90-96, 109-115, 
199-205 

Short-chain alcohol dehyrogenase . 

amino acids 30-42, 104-114 
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FIGURE 12Q 

AACTTCTACATGGGCCTCCTGCTGCTGGTGCTCTTCCTCAGCCTCCTGCCGGTGGCCTACACCAT 
CATGTCCCTCCCACCCTCCTTTGACTGCGGGCCGTTCAGGTGCAGAGTCTCAGTTGCCCGGGAGC 
ACCTCCCCTCCCGAGGCAGTCTGCTCAGAGGGCCTCGGCCCAGAATTCCAGTTCTGGTTTCATGC 
CAGCCTGTAAAAGGCCATGGAACTTTGGGTGAATCACCGATGCCATTTAAGAGGGTTTTCTGCCA 
GGATGGAAATGTTAGGTCGTTCTGTGTCTGCGCTGTTCATTTCAGTAGCCACCAGCCACCTGTGG 
CCGTTGAGTGCTTGAAATGAGGAACTGAGAAAATTAATTTCTCATGTATTTTTCTCATTTATTTA 
T TAAT T T T TAAC T GATAGT TGTACATAT T TGGGGGTACAT GT GATATT TGGATACATGTATACAA 
!L. TATATAATGATCAAAT CAGGG TAAC TGGGATAT CCAT CACATCAAACAT T TAT T T T TTAT TCT T T 

fil T TAGACAGAGTCT CACTCT GTCACCCAGGC T GGAGTGCAGT GGT GCCAT CTCAGCTTACTGCAAC 

||! CTCTGCCTGCCAGGTTCAAGCGATTCTCATGCCTCCACCTCCCAAGTAGCTGGGACTACAGGCAT 
W GCACCACAATGCCCAACTAAT TT T TGTATT T T TAGT AGAGACGGGGT T T TGCCAT GT T GCCCAGG 

y CTGGCCTTGAACTCCTGGCCTCAAACAATCCACTTGCCTCGGCCTCCCAAAGTGTTATGATTACA 
Cl GGCGTGAGCCACCGTGCCTGGCCTAAACATTTATCTTTTCTTTGTGTTGGGAACTTTGAAATTAT 

y * 

ACAATGAATTATTGTTAACTGTCATCTCCCTGCTGTGCTATGGAACACTGGGACTTCTTCCCTCT 
£1 ATCTAACTGTATATTTGTACCAGTTAACCAACCGTACTTCATCCCCACTCCTCTCTATCCTTCCC 
|?1 AACCT C TGAT CACC TCAT TCT ACTC TCTACCT CCAT GAGAT CCAC T TT T T TAGCT CCCACATGTG 

AGTAAGAAAATGCAATATTTGTCTTTCTGTGCCTGGCTTATTTCACTTAACATAATGACTTCCTG 
JSf TT CCAT CCAT GT TGCT GCAAATGACAGGAT T T CGTT CT TAAT T TCAAT TAAAATAACCACACATG 

f|| GCAAAAA 
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FIGURE 130 

MGLLLLVLFLSLLPVAYTIMSLPPSFDCGPFRCRVSVAREHLPSRGSLLRGPRPRIPVLVSCQPV 
KGHGTLGESPMPFKRVFCQDGNVRSFCVCAVHFSSHQPPVAVECLK 

Important features of the protein: 
Signal peptide: 

amino acids 1-18 

N-myristoylation site. 

amino acids 86-92 

Zinc carboxypeptidases , zinc-binding region 2 signature. 

amino acids 68-79 
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FIGURE 131 

TT CTGAAGTAACGGAAGCTACCTTGT AT AAAG ACCT CAAC ACTGCT GACC ATGA T CAGGG CAGCCT GG AGC 
AT CT TCCT CAT CGGGACT AAAATT GGGC TGT T CCTT CAAGTAGCACCT CT AT CAGTT ATGGCTAAAT C CT G 
TC C ATCT GTGTGT CGCTGCG AT GCGGGT TT CAT T TACTGT AATGATCGCTTT CT GACATCCATT CC AACAG 
GAATACCAGAGGATGCTACAACTCTCTACCTTCAGAACAACCAAATAAATAATGCTGGGATTCCTTCAGAT 
TT GAAAAACT TGCT GAAAGT AGAAAGAAT ATACCT AT ACC AC AACAGT TT AGAT GAAT TT CCTACC AACC T 
CCCAAAGT AT GT AAAAGAGT T ACATT TGC AAGAAAAT AAC AT AAGG AC T ATC ACT T ATGATTC ACTTTC AA 
AAAT TCCC T ATCTGGAAGAATT AC ATTT AGAT GACAACTCTGTCT C TGCAGT T AGC AT AGAAGAGGGAGCA 
TT CCGAGAC AGC AACT ATCTCCGACT GCTT T T CCTGT CCCGT AAT C AC CT TAGCAC AATT CCCT GGGGTT T 
GCCCAGGACT AT AGAAGAACT ACGCT TGGAT G AT AAT C GC AT AT CC AC TATT TCATCACC ATCT CT TCAAG 
GT CT CACT AGT CT AAAACGC CT GGTT CT AGAT GGAAAC CTGT TG AACAATC ATGGTTT AGGT GACAAAGTT 
TT CT TCAACCT AGT TAATT T GACAGAGC TGT CCCTGGT GCGGAATTCC CT GACTGCTGCACC AGT AAACCT 
TCCAGGCACAAACCTGAGGAAGCTTTATCTTCAAGATAACCACATCAATCGGGTGCCCCCAAATGCTTTTT 
CT T ATCTAAGGC AGCT CT AT CG ACTGGAT ATGTC CAAT AAT AACCT AAGT AATT T ACCTC AGGGT AT CTTT 
GATG ATTT GGACAATAT AAC AC AACTGAT TCT TCGCAAC AATCCCT GGTATTGCGGGTGC AAGATGAAAT G 
GGT ACGT GACT GGTTACAAT CACT ACCT GTGAAGGTCAACGT GCGT GGGCTC ATGT GC CAAGCCCC AG AAA 
AGGTTCGT GGGATGGCTATT AAGGATCT CAATGCAGAACT GT TTGATT GT AAGGAC AGTGGGAT TGT AAGC 
ACCATTCAGAT AAC CACTGCAATACC CAAC AC AGT GTATCCT GC CC AAGGACAGTGGCCAGCTCC AGT GAC 
CAAACAGCCAGAT ATT AAGAAC CCCAAGCTCACT AAGG ATCAACAAAC CACAGGGAGT CCCTCAAG AAAAA 
CAAT TACAAT T ACT GT GAAGTCTGTC AC CTCT GATACC AT T CAT AT C TCTTGGAAACT TGCT CT AC CT AT G 
ACTG CTTT GAGACT CAGCT GGCTT AAAC TGGGCCAT AGCCCGGC AT T TGGAT CT AT AACAGAAAC AAT TGT 
AAC AGGGGAACGCAGT GAGT ACTTGGTC ACAGCCCTGGAGCCTGAT T C AC CC T ATAAAGT AT GCAT GGTTC 
CC AT GGAAACC AGC AACCTCT ACCT ATT TGAT GAAACT CCTGTT TGT AT T GAGACTGAAACT GCAC CC CT T 
CGAATGTACAACCCTACAACCACCCTCAATCGAGAGCAAGAGAAAGAACCTTACAAAAACCCCAATTTACC 
TTTGGCTGCCATCATTGGTGGGGCTGTGGCCCTGGTTACCATTGCCCTTCTTGCTTTAGTGTGTTGGTATG 
TT C ATAGG AATGGATCGCTCTT CT CAAGGAAC TGTGC AT ATAGC AAAGGGAGGAGAAG AAAGGAT GACT AT 
GCAGAAGC TGGCACT AAGAAGG AC AACTCTATCCTGGAAATCAGGGAAACTT CTT T TC AGAT GT T ACC AAT 
AAGC AAT GAACCCATCTC GAAGGAGGAGT T TGT AAT AC ACACCAT ATTTCCTCCT AAT GGAATG AATCTGT 
ACAAAAAC AAT C AC AGTGAAAGCAGT AGT AAC CGAAGC TACAGAGACAGT GGT AT T CC AGAC TC AG AT CAC 
TC AC ACTC ATGATGCT GAAGGACT C ACAGC AGACTTGT GT T TTGGGT TTT TT AAACCT AAGGGAGGTGAT G 
GT 
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FIGURE 132 

MISAAWSIFLIGTKIGLFLQVAPLSVMAKSCPSVCRCDAGFIYCNDRFLTSIPTGIPEDATTLYL 
QNNQ INNAG I PS DLKNLLKVERI YL YHNSLDE FPTNLPKYVKE LHLQENNI RT I T YDSLSKI PYL 
EELHLDDNSVSAVSIEEGAFRDSNYLRLLFLSRNHLSTIPWGLPRTIEELRLDDNRISTISSPSL 
QGLTSLKRLVLDGNLLNNHGLGDKVFFNLVNLTELSLVRNSLTAAPVNLPGTNLRKLYLQDNHIN 
RVPPNAFSYLRQLYRLDMSNNNLSNLPQGIFDDLDNITQLILRNNPWYCGCKMKWVRDWLQSLPV 
KVKTOGLMCQAPEKVRGMAIKDLNAELFDCKDSGIVST IQI TTAI PNTVYPAQGQWPAPVTKQPD 
IKNPKLTKDQQTTGSPSRKTITITVKSVTSDTIHISWKIiALPMTALRLSWLKLGHSPAFGSITET 
IVTGERSEYLVTALEPDSPYKVCMVPMETSNLYLFDETPVCIETETAPLRMYNPTTTLNREQEKE 
PYKNPNL PLAAI I GGAVALVT I ALLAL VCW YVHRNGS L FSRNCAYSKGRRRKDDYAE AGTKKDNS 
ILEIRETSFQMLPISNEPISKEEFVIHTIFPPNGMNLYKNNHSESSSNRSYRDSGIPDSDHSHS 

Important features of the protein: 
Signal peptide: 

amino acids 1-28 
Transmembrane domain: 
amino acids 531-552 
N- glycosylate on sites. 

amino acids 226-229, 282-285, 296-299, 555-558, 626-629, 633-636 
Tyrosine kinase phosphorylation site, 
amino acids 515-522 
N-myristoylation sites. 

amino acids 12-17, 172-177, 208-213, 359-364, 534-539, 556-561, 
640-645 

Amidation site. 

amino acids 567-570 
Leucine zipper pattern. 

amino acids 159-180 

Phospholipase A2 aspartic acid active site. 

amino acids 34-44 
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FIGURE laa 

CCGTCATCCCCCTGCAGCCACCCTTCCCAGAGTCCTTTGCCCAGGCCACCCCAGGCTTCTTGGCA 
GCCCTGCCGGGCCACTTGTCTTCATGTCTGCCAGGGGGAGGTGGGAAGGAGGTGGGAGGAGGGCG 
T GCAGAGGCAGT C TGGGCT T GGCCAGAGCTCAGGGTGCTGAGCG T GTGACCAGCAGTGAGCAGAG 
GCCGGCCATGGCCAGCCTGGGGCTGCTGCTCCTGCTCTTACTGACAGCACTGCCACCGCTGTGGT 
CCTCCTCACTGCCTGGGCTGGACACTGCTGAAAGTAAAGCCACCATTGCAGACCTGATCCTGTCT 
GCGC TGGAGAGAGCCACCGT CT T CCTAGAACAGAGGC T GCC T GAAATCAACC TGGATGGCAT GGT 
GGGGGTCCGAGTGCTGGAAGAGCAGCTAAAAAGTGTCCGGGAGAAGTGGGCCCAGGAGCCCCTGC 
TGCAGCCGCTGAGCCTGCGCGTGGGGATGCTGGGGGAGAAGCTGGAGGCTGCCATCCAGAGATCC 
C TCCAC TACC TCAAGC TGAGT GAT CCCAAGTACCT AAGAGAGT TCCAGCTGACCCTCCAGCCCGG 
GTTTTGGAAGCTCCCACATGCCTGGATCCACACTGATGCCTCCTTGGTGTACCCCACGTTCGGGC 
CCCAGGACTCATTCTCAGAGGAGAGAAGTGACGTGTGCCTGGTGCAGCTGCTGGGAACCGGGACG 
GACAGCAGCGAGCCCTGCGGCCTCTCAGACCTCTGCAGGAGCCTCATGACCAAGCCCGGCTGCTC 
AGGCTACTGCCTGTCCCACCAACTGCTCTTCTTCCTCTGGGCCAGAATGAGGGGATGCACACAGG 
GACCACT CCAACAGAGCCAGGAC TATATCAACCT C T T CT GC GCCAACATGATGGACT TGAACCGC 
AGAGC TGAGGCCAT CGGATACGCC TACCCTACCCGGGACAT CTTCATGGAAAACATCAT GT TCTG 
TGGAATGGGCGGCTTCTCCGACTTCTACAAGCTCCGGTGGCTGGAGGCCATTCTCAGCTGGCAGA 
AACAGCAGGAAGGATGCTTCGGGGAGCCTGATGCTGAAGATGAAGAATTATCTAAAGCTATTCAA 
TATCAGCAGCAT T T TT CGAGGAGAGTGAAGAGGCGAGAAAAACAAT TT CCAGATT CTCGCTCTGT 
TGCTCAGGCTGGAGTACAGTGGCGCAATCTCGGCTCACTGCAACCTTTGCCTCCTGGGTTCAAGC 
AATTCTCTTGCCTCATCCTCCCGAGTAGCTGGGACTACAGGAGCGTGCCACCATACCTGGCTAAT 
TTTTATATTTTTTTAGTAGAGACAGGGTTTCATCATGTTGCTCATGCTGGTCTCGAACTCCTGAT 
CTCAAGAGATCCGCCCACCTCAGGCTCCCAAAGTGTGGGATTATAGGTGTGAGCCACCGTGTCTG 
GCTGAAAAGCACTTTCAAAGAGACTGTGTTGAATAAAGGGCCAAGGTTCTTGCCACCCAGCACTC 
ATGGGGGCTCTCTCCCCTAGATGGCTGCTCCTCCCACAACACAGCCACAGCAGTGGCAGCCCTGG 
GTGGC T T CC TATACAT CCTGGCAGAATACCCCCCAGCAAACAGAGAGCCACACCCAT CCACACCG 
CCACCACCAAGCAGCCGCT GAGACGGACGGT TCCATGC CAGC TGCC TGGAGGAGGAACAGACCCC 
TTTAGTCCTCATCCCTTAGATCCTGGAGGGCACGGATCACATCCTGGGAAGAAGGCATCTGGAGG 
ATAAGCAAAGCCACCCCGACACCCAATCTTGGAAGCCCTGAGTAGGCAGGGCCAGGGTAGGTGGG 
GGCCGGGAGGGACCCAGGT GT GAACGGAT GAATAAAGT TCAAC TGCAAC TGAAAAAAAAAAA 
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FIGURE ISA 



feds 



5 ^ 



m 
o 



MSARGRWEGGGRRACRGSLGLARAQGAERVTSSEQRPAMASLGLLLLLLLTALPPLWSSSLPGLD 
TAESKATIADLILSALERATVFLEQRLPEINLDGMVGVRVLEEQLKSVREKWAQEPLLQPLSLRV 
GMLGEKLEAAI QRS LHYLKLS DPKYLRE FQL T LQPG FWKLPHAW I HTDASLVYPT FGPQDS FSEE 
RSDVCLVQLLGTGTDSSEPCGLSDLCRSLMTKPGCSGYCLSHQLLFFLWARMRGCTQGPLQQSQD 
Y INLFCANMMDLNRRAEAI GYAYP TRD I FMENIMFCGMGGFSDFYKLRWLEAI LSWQKQQEGCFG 
EPDAEDEELSKAIQYQQHFSRRVKRREKQFPDSRSVAQAGVQWRNLGSLQPLPPGFKQFSCLILP 
SSWDYRSVPPYLANFYIFLVETGFHHVAHAGLELLISRDPPTSGSQSVGL 



Important features of the protein: 
(5s Signal peptide: 

2 .; t: 

W amino acids 1-2 6 



Transmembrane domain: 

amino acids 39-56 

Tyrosine kinase phosphorylation sites. 

amino acids 149-156, 274-282 

N-myristoylation sites. 

amino acids 10-16, 20-26, 63-69, 208-214 

Amidation site. 

amino acids 10-14 

Glycoprotein hormones beta chain signature 1. 

amino acids 230-237 



APP ID=10063716 



Page 288 of 322 



FIGURE 135 

GGTCTGAGTGCAGAGCTGCTGTCATGGCGGCCGCTCTGTGGGGCTTCTTTCCCGTCCTGCTGCTG 
CTGCTGCTATCGGGGGATGTCCAGAGCTCGGAGGTGCCCGGGGCTGCTGCTGAGGGATCGGGAGG 
GAGTGGGGTCGGCATAGGAGATCGCTTCAAGATTGAGGGGCGTGCAGTTGTTCCAGGGGTGAAGC 
CTCAGGACTGGATCTCGGCGGCCCGAGTGCTGGTAGACGGAGAAGAGCACGTCGGTTTCCTTAAG 
ACAGATGGGAGT T TT GT GGT T CAT GATATACCTT CT GGATCT TAT GTAGTGGAAGT TGTAT CT CC 
AGCT TACAGAT T T GAT CCCGTTCGAGTGGATATCACT T CGAAAGGAAAAATGAGAGCAAGATATG 
TGAAT TACAT CAAAACATCAGAGGTT GTCAGACT GCCC TAT CCTC TCCAAAT GAAAT CT T CAGG T 
CCACCTTCTTACTTTATTAAAAGGGAATCGTGGGGCTGGACAGACTTTCTAATGAACCCAATGGT 
TATGATGATGGTTCTTCCTTTATTGATATTTGTGCTTCTGCCTAAAGTGGTCAACACAAGTGATC 
C TGACATGAGACGGGAAAT GGAGCAGTCAAT GAATAT GCTGAAT TCCAACCATGAGT T GCCT GAT 
GTTTCTGAGTTCATGACAAGACTCTTCTCTTCAAAATCATCTGGCAAATCTAGCAGCGGCAGCAG 
TAAAACAGGCAAAAGTGGGGCTGGCAAAAGGAGGTAGT CAGGCCGT CCAGAGC TGGCAT T TGCAC 
AAACACGGCAACACTGGGTGGCATCCAAGTCTTGGAAAACCGTGTGAAGCAACTACTATAAACTT 
GAGTCAT CCCGACGTT GAT C T CTTACAACTGT GTATGT T 

AAC T T T T TAGCACATGT T T T GTAC T T GGTACACGAGAAAACCCAGC TT TCATCT T T T GTCTGTAT 
GAGGTCAATATT GATGTCAC TGAAT TAATTACAGT GT CC TATAGAAAATGCCAT TAATAAAT TAT 
ATGAACTACTATACAT TAT GTATAT TAATTAAAACAT CT TAATCCAGAAATCAAAAAAAAAAAAA 
AAAAAAAAAAAAAAA 
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FIGURE 136 

MAAALWGFFPVLLLLLLSGDVQSSEVPGAAAEGSGGSGVGIGDRFKIEGRAVVPGVKPQDWISAA 
RVLVDGEEHVGFLKTDGSFVVHDIPSGSYVVEW 
WRLPYPLQMKSSGPPSYFIKRESWGWTDFLMNPMVMM^ 
QSMNMLNSNHELPDVSEFMTRLFSSKSSGKSSSGSSKTGKSGAGKRR 

Important features of the protein: 
Signal sequence: 

amino acids 1-23 

Transmembrane domain: 

amino acids 161-182 

N-glycosylation site. 

amino acids 184-187 

Glycosaminoglycan attachment sites. 

amino acids '37-40, 236-239 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 151-154 

N-myristoylation sites. 

amino acids 33-38, 36-41, 38-44, 229-234 

Amidation site. 

amino acids 238-241 

ATP/GTP-binding site motif A (P-loop) . 

amino acids 229-236 



APPJD=10063716 



Page 290 



FIGURE 137 



GATGGCGCAGCCACAGCTTCTGTGAGATTCGATTTCTCCCCAGTTCCCCTGTGGGTCTGAGGGGA 
CCAGAAGGGTGAGCTACGTTGGCTTTCTGGAAGGGGAGGCTATATGCGTCAATTCCCCAAAACAA 
GTTTTGACATTTCCCCTGAAATGTCATTCTCTATCTATTCACTGCAAGTGCCTGCTGTTCCAGGC 
CTTACCTGCTGGGCACTAACGGCGGAGCCAGGATGGGGACAGAATAAAGGAGCCACGACCTGTGC 
CACCAAC TCGCAC TCAGACTC TGAAC T CAGACCTGAAAT CT T CTC T TCACGGGAGGCT TGGCAGT 
TTTTCTTACTCCTGTGGTCTCCAGATTTCAGGCCTAAGATGAAAGCCTCTAGTCTTGCCTTCAGC 
CTTCTCTCTGCTGCGTTTTATCTCCTATGGACTCCTTCCACTGGACTGAAGACACTCAATTTGGG 
AAG C T G T G T GAT C GCC ACAAAC C T T CAGGAAATAC GAAAT GGAT T T T C T GAGAT AC G GGG CAGT G 
TGCAAGCCAAAGAT GGAAACAT TGACATCAGAAT CT TAAGGAGGAC TGAGTCT T TGCAAGACACA 
AAGCCTGCGAATCGATGCTGCCTCCTGCGCCATTTGCTAAGACTCTATCTGGACAGGGTATTTAA 
AAACTACCAGACCCCTGACCATTATACTCTCCGGAAGATCAGCAGCCTCGCCAATTCCTTTCTTA 
S'l CCATCAAGAAGGACCTCCGGCTCTCTCATGCCCACATGACATGCCATTGTGGGGAGGAAGCAATG 
N AAGAAATACAGCCAGAT TCTGAGTCACT TTGAAAAGCTGGAACCTCAGGCAGCAGTTGTGAAGGC 

m 

T T T GGGGGAAC TAGACAT T C T T C T GC AAT GGAT GGAG GAG ACAG A ATAGG AGGAAAG T GAT GC T G 
CTGC TAAGAATAT T CGAGGT CAAGAGC TCCAGTCTT CAAT ACC TGCAGAGGAGGCAT GACCCCAA 
Ifl ACCACCATCTCTTTACTGTACTAGTCTTGTGCTGGTCACAGTGTATCTTATTTATGCATTACTTG 
G| C T TCCT TGCATGAT TGT CT T TAT GCATCCCCAAT CT TAATT GAGACCATAC T TGTATAAGATT T T 

Si TGTAATATCTTTCTGCTATTGGATATATTTATTAGTTAATATATTTATTTATTTTTTGCTATTTA 
ATGTATT TATT TT T T TAC T TGGACATGAAAC T T TAAAAAAAT TCACAGATTATAT T T ATAACCTG 
ACTAGAGCAGGTGAT GTATT T TTATACAGTAAAAAAAAAAAACCT TGTAAAT TC TAGAAGAGT GG 
CTAGGGGGGT T AT TCAT T TGTAT TCAAC TAAGGACATAT T TAC TCATGC TGAT GCT C TGT GAGAT 
ATT TGAAAT TGAAC CAAT GACTACT TAGGATGGGT T GTGGAATAAGT T TT GATGTGGAAT T GCAC 
ATCTACCTTACAAT TACT GACCATCCCCAGTAGACTCCCCAGTCC CAT AAT TGT GTATCTTCCAG 
CCAGGAATCCTACACGGCCAGCATGTATTTCTACAAATAAAGTTTTCTTTGCATACCAAAAAAAA 

AAAAAAAAAAA 



St*;? 
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FIGURE 15*8 



MRQFPKTSFDISPEMSFSIYSLQVPAVPGLTCWALTAEPGWGQNKGATTCATNSHSDSELRPEIF 

SSREAWQFFLLLWSPDFRPKMKASSLAFSLLSAAFYLLWTPSTGLKTLNLGSCVIATNLQEIRNG 

FSEIRGSVQAKDGNIDIRILRRTESLQDTKPANRCCLLRHLLRLYLDRVFKNYQTPDHYTLRKIS 

SLANS FLT IKKDLRLSHAHMTCHCGEEAMKKYSQI ^ 

E 

Important features of the protein: 
Signal peptide: 




amino acids 1-42 



cAMP- and cGMP-dependent protein kinase phosphorylation sites. 



"4 



amino acids 192-195, 225-228 



N-myristoylation sites. 

amino acids 42-47, 46-51, 136-141 



f|l 
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FIGURE lftQ 



CCTGGAGCCGGAAGCGCGGCTGCAGCAGGGCGAGGCTCCAGGTGGGGTCGGTTCCGCATCCAGCC 
TAGCGTGTCCACGATGCGGCTGGGCTCCGGGACTTTCGCTACCTGTTGCGTAGCGATCGAGGTGC 
TAGGGATCGCGGTCTTCCTTCGGGGATTCTTCCCGGCTCCCGTTCGTTCCTCTGCCAGAGCGGAA 
CACGGAGCGGAGCCCCCAGCGCCCGAACCCTCGGCTGGAGCCAGTTCTAACTGGACCACGCTGCC 
ACCACCTCTCTTCAGTAAAGTTGTTATTGTTCTGATAGATGCCTTGAGAGATGATTTTGTGTTTG 
GGTCAAAGGGTGTGAAATTTATGCCCTACACAACTTACCTTGTGGAAAAAGGAGCATCTCACAGT 
T T TGT GGC TGAAGCAAAGCCACC TACAGTTACT AT GCCTCGAATCAAGGCAT TGAT GACGGGGAG 
CCTTCCTGGCTTTGTCGACGTCATCAGGAACCTCAATTCTCCTGCACTGCTGGAAGACAGTGTGA 
TAAGACAAGCAAAAGCAGCTGGAAAAAGAATAGT CT T T TAT GGAGATGAAACC TGGGT TAAAT TA 
T TCCCAAAGCAT T T TGT GGAATATGATGGAACAACCTCAT T T T TCGTGTCAGATTACACAGAGGT 
GGATAATAATGTCACGAGGCATTTGGATAAAGTATTAAAAAGAGGAGATTGGGACATATTAATCC 
H TCCACTACCTGGGGCTGGACCACATTGGCCACATTTCAGGGCCCAACAGCCCCCTGATTGGGCAG 
Q AAGC TGAGCGAGAT GGACAGCGT GCT GATGAAGAT CCACACC T CAC TGCAGTCGAAGGAGAGAGA 

f% GACGCCTTTACCCAATTTGCTGGTTCTTTGTGGTGACCATGGCATGTCTGAAACAGGAAGTCACG 
S GGGCC TCCTCCACCGAGGAGGTGAATACACCTCTGAT TT TAATCAGTTCTGCGTT TGAAAGGAAA 

^ I CCCGGTGATATCCGACATCCAAAGCACGTCCA ATAGA CGGATGTGGCTGCGACACTGGCGATAGC 
III AC T TGGC TTACCGAT T CCAAAAGACAGT GTAGGGAGCCT CCT AT TCCCAGTTGTGGAAGGAAGAC 

\\ CAATGAGAGAGCAGT T GAGAT T T T TACATT TGAATACAGTGCAGC T TAGTAAAC T GT TGCAAGAG 

5 <, AATGTGCCGTCATATGAAAAAGATCCTGGGTTTGAGCAGTTTAAAATGTCAGAAAGATTGCATGG 
i|| GAACT GGAT CAGAC TGTACT T GGAGGAAAAGCATT CAGAAGT CCTAT T CAACCT GGGCT CCAAGG 

^ ? TTCTCAGGCAGTACCTGGATGCTCTGAAGACGCTGAGCTTGTCCCTGAGTGCACAAGTGGCCCAG 
~ TTCTCACCCTGCTCCTGCTCAGCGTCCCACAGGCACTGCACAGAAAGGCTGAGCTGGAAGTCCCA 
Q CTGTCATCTCCTGGGTTTTCTCTGCTCTTTTATTTGGTGATCCTGGTTCTTTCGGCCGTTCACGT 
§p CATTGTGTGCACCTCAGCTGAAAGTTCGTGCTACTTCTGTGGCCTCTCGTGGCTGGCGGCAGGCT 
%l GCCTTTCGTTTACCAGACTCTGGTTGAACACCTGGTGTGTGCCAAGTGCTGGCAGTGCCCTGGAC 
Jjf AGGGGGCCTCAGGGAAGGACGTGGAGCAGCCTTATCCCAGGCCTCTGGGTGTCCCGACACAGGTG 
61 TTCACATCTGTGCTGTCAGGTCAGATGCCTCAGTTCTTGGAAAGCTAGGTTCCTGCGACTGTTAC 
Q CAAGGTGATTGTAAAGAGCTGGCGGTCACAGAGGAACAAGCCCCCCAGCTGAGGGGGTGTGTGAA 
•ft J TCGGACAGCC TCCCAGCAGAGGTGT GGGAGC T GCAGCTGAGGGAAGAAGAGACAAT CGGCC TGGA 

CACTCAGGAGGGTCAAAAGGAGACTTGGTCGCACCACTCATCCTGCCACCCCCAGAATGCATCCT 
GCCTCATCAGGTCCAGATTTCTTTCCAAGGCGGACGTTTTCTGTTGGAATTCTTAGTCCTTGGCC 
TCGGACACCTTCATTCGTTAGCTGGGGAGTGGTGGTGAGGCAGTGAAGAAGAGGCGGATGGTCAC 
AC TCAGATCCACAGAGCCCAGGAT CAAGGGACCCAC TGCAGT GGCAGCAGGACT GT T GGGCCCCC 
ACCCCAACCCTGCACAGCCCTCATCCCCTCTTGGCTTGAGCCGTCAGAGGCCCTGTGCTGAGTGT 
CTGACCGAGACACTCACAGCTTTGTCATCAGGGCACAGGCTTCCTCGGAGCCAGGATGATCTGTG 
CCACGCTTGCACCTCGGGCCCATCTGGGCTCATGCTCTCTCTCCTGCTATTGAATTAGTACCTAG 
CTGCACACAGTAT GT AGTT ACCAAAAGAATAAACGGCAAT AAT TGAGAAAAAAAA 
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FIGURE lao 

mrlgsgtfatccvaievlgiavflrgffpapvrssarm:hgaeppapepsagassnwttlppplf 
skwivlidalrddfvfgskgvkfmpyttylvekgashsfvaeakpptvtmprikalmt 
vdvi rnlns palleds virqakaagkri vfygde t wvkl fpkhfveydgt t s ffvs d ytevdnnv 
trhldkvlkrgdwdililhylgldhighisgpnspligqklsemdsvlmkihtslqskeretplp 
nllvlcgdhgmsetgshgassteevntplilissaferkpgdirhpkhvq 

Important features of" the protein: 



Signal peptide: 

y amino acids 1-34 

if a I Transmembrane domain: 

\| amino acids 58-76 

N-glycosylation sites. 

m amino acids 56-60, 194-198 



N-myristoylation sites. 

amino acids 6-12, 52-58, 100-106, 125-131, 233-239, 270-276, 
275-281, 278-284 

Amidation site. 

amino acids 154-158 



Cell attachment sequence. 

amino acids 205-208 
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FIGURE lai 

GGCACGAGGCAAGCCTTCCAGGTTATCGTGACGCACCTTGAAAGTCTGAGAGCTACTGCCCTACA 
GAAAGTTACTAGTGCCCTAAAGCTGGCGCTGGCACTGATGTTACTGCTGCTGTTGGAGTACAACT 
TCCCTATAGAAAACAACTGCCAGCACCT TAAGACCACT CACACC T TCAGAGT GAAGAAC T TAAAC 
C CGAAGAAATTCAGCAT TCATGACCAGGATCACAAAGTAC TGGTCCTGGAC T C T GGGAAT CT CAT 
AGCAGTTCCAGATAAAAACTACATACGCCCAGAGATCT TCTT TGCATTAGCCTCATCCT TGAGCT 
CAGCCTCTGCGGAGAAAGGAAGTCCGATTCTCCTGGGGGTCTCTAAAGGGGAGTTTTGTCTCTAC 
TGTGACAAGGATAAAGGACAAAGTCATCCATCCCT T CAGCT GAAGAAGGAGAAACTGAT GAAGCT 
GGCTGCCCAAAAGGAATCAGCACGCCGGCCCTTCATCTTTTATAGGGCTCAGGTGGGCTCGTGGA 
ACATGCTGGAGTCGGCGGCTCACCCCGGATGGTTCATCTGCACCTCCTGCAATTGTAATGAGCCT 
GTTGGGGTGACAGATAAATT TGAGAACAGGAAACACATT GAATTT TCAT TTCAACCAGTT TGCAA 
AGCTGAAATGAGCCCCAGTGAGGTCAGCGAT TAGG AAACTGCCCCATTGAACGCCTTCCTCGCTA 
ATT TGAACTAAT TGTATAAAAACACCAAACCTGCTCACT 
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FIGURE 142 

MLLLLLEYNFP I ENNCQHLKTTHT FRVKNLNPKKFS I HDQDHKVLVLDSGNL I AVPDKNY IRPE I 
FFALASSLSSASAEKGSPILLGVSKGEFCLYCDKDKGQSHPSLQLKKEKLMKIiAAQKESARRPFI 
FYRAQVGS WNMLE S AAHPGW FI CT SCNCNE PVGVTDKFENRKH I E FS FQPVCKAEMS PSEVSD 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 33-36 

N-myristoylation site. 

amino acids 50-55, 87-92 

Interleukin-1 

amino acids 37-182 
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FIGURE 143 

CTAGAGAGTATAGGGCAGAAGGATGGCAGAT GAGTGACTCCACAT CCAGAGC T GCCTCCC T T TAA 
TCCAGGATCCTGTCCTTCCTGTCCTGTAGGAGTGCCTGTTGCCAGTGTGGGGTGAGACAAGTTTG 
TCCCACAGGGCTGTCTGAGCAGATAAGATTAAGGGCTGGGTCTGTGCTCAATTAACTCCTGTGGG 
CACGGGGGCTGGGAAGAGCAAAGTCAGCGGTGCCTACAGTCAGCACCATGCTGGGCCTGCCGTGG 
AAGGGAGGTCTGTCCTGGGCGCTGCTGCTGCTTCTCTTAGGCTCCCAGATCCTGCTGATCTATGC 
CTGGCATT TCCACGAGCAAAGGGAC T GT GATGAACACAAT GT CATGGC T CGT TACCTCCCTGCCA 
CAGTGGAGTTTGCTGTCCACACATTCAACCAACAGAGCAAGGACTACTATGCCTACAGACTGGGG 
CACATCTTGAATTCCTGGAAGGAGCAGGTGGAGTCCAAGACTGTATTCTCAATGGAGCTACTGCT 
GGGGAGAAC TAGGTGT GGGAAATT T GAAGACGACATT GACAAC TGCCAT T TCCAAGAAAGCACAG 
AGCTGAACAATACTTTCACCTGCTTCTTCACCATCAGCACCAGGCCCTGGATGACTCAGTTCAGC 
CTCCTGAACAAGACCTGCTTGGAGGGATTCCACTGAGTGAAACCCACTCACAGGCTTGTCCATGT 
GC TGCT CCCACAT T CCGTGGACAT CAGCAC TACT CT CCT GAGGAC T CTTCAGTGGCTGAGCAGC T 
T T GGAC TTGTTTGT TAT CCT AT T T T GC ATG T G T T T GAGAT CT C AGAT C AG TG T T T T AG AAAAT CC 
ACACATC TT GAGCCTAAT CAT GTAGTGTAGAT CAT TAAACAT CAGCAT T T TAAGAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAA&AAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE Mi 

MLGLPWKGGL S WALLLLLLGS QI LL I YAWHFHE QRDCDEHNVMARYL PATVE FAVHT FNQQSKDY 
YAYRLGHILNSWKEQVESKTVFSMELLLGRTRCGKFEDDIDNCHFQESTELNNTFTCFFTISTRP 
WMTQFSLLNKTCLEGFH 

Important features of the protein: 
Signal peptide: 

amino acids 1-25 

N-glycosylation sites. 

amino acids 117-121, 139-143 

N-myristoylation site. 

amino acids 9-15 
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FIGURE 

CTGTGCAGCTCGAGGCTCCAGAGGCACACTCCAGAGAGAGCCAAGGTTCTGACGCGATGAGGAAG 
CACCTGAGCTGGTGGTGGCTGGCCACTGTCTGCATGCTGCTCTTCAGCCACCTCTCTGCGGTCCA 
GACGAGGGGCATCAAGCACAGAATCAAGTGGAACCGGAAGGCCCTGCCCAGCACTGCCCAGATCA 
C T GAGGCCCAGGT GGC T GAGAACCGCCCGGGAGCC TT CATCAAGCAAGGCCGCAAGC TCGACAT T 
GACTTCGGAGCCGAGGGCAACAGGTACTACGAGGCCAACTACTGGCAGTTCCCCGATGGCATCCA 
CTACAACGGCTGCTCTGAGGCTAATGTGACCAAGGAGGCATTTGTCACCGGCTGCATCAATGCCA 
CCCAGGCGGCGAACCAGGGGGAGT TCCAGAAGCCAGACAACAAGC TCCACCAGCAGGTGCTC T GG 
CGGCTGGTCCAGGAGCTCTGCTCCCTCAAGCATTGCGAGTTTTGGTTGGAGAGGGGCGCAGGACT 
TCGGGTCACCATGCACCAGCCAGTGCTCCTCTGCCTTCTGGCTTTGATCTGGCTCATGGTGAAAT 
AAGCTTGCCAGGAGGCTGGCAGTACAGAGCGCAGCAGCGAGCAAATCCTGGCAAGTGACCCAGCT 
CTTCTCCCCCAAACCCACGCGTGTTCTGAAGGTGCCCAGGAGCGGCGATGCACTCGCACTGCAAA 
TGCCGCTCCCACGTATGCGCCCTGGTATGTGCCTGCGTTCTGATAGATGGGGGACTGTGGCTTCT 
C C G T C AC TC CAT T C T C AG C C C C T AGC AGAGC G T C T GG C AC AC T AGAT TAG TAG T AAAT G C T T GAT 
GAGAAGAACACATCAGGCACTGCGCCACCTGCTTCACAGTACTTCCCAACAACTCTTAGAGGTAG 
GTGTAT T CC CGTT TTACAGATAAGGAAACTGAGGCCCAGAGAGCTGAAGTAC TGCACCCAGCATC 
ACCAGCTAGAAAGTGGCAGAGCCAGGATTCAACCCTGGCTTGTCTAACCCCAGGTTTTCTGCTCT 
GTCCAATTCCAGAGCTGTCTGGTGATCACTTTATGTCTCACAGGGACCCACATCCAAACATGTAT 
CT C TAATGAAAT T GTGAAAGC TCCATGT T TAGAAATAAATGAAAACACCTGA 
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FIGURE 146 

MRKHLSWWWIATVCMLLFSHLSAVQT 

LDIDFGAEGNRYYEANYWQFPDGIHYNGCSEANVTKEAFVTGCINATQAANQGEFQKPDNKLHQQ 
VLWRLVQELCSLKHCEFWLERGAGLRVTMHQPVLLCLLALIWLMVK 

Important features of the protein: 
Signal peptide: 

amino acids 1-26 

Transmembrane domain: 

amino acids 157-171 

N-glycosylation sites . 

amino acids 98-102, 110-114 

Tyrosine kinase phosphorylation site. 

amino acids 76-83 

N-myristoylation sites. 

amino acids 71-77, 88-94, 93-99, 107-113, 154-160 

Amidation site. 

amino acids 62-66 
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FIGURE 147 

GCCTTGGCCTCCCAAAGGGCTGGGATTATAGGCGTGACCACCATGTCTGGTCCAGAGTCTCATTT 
CCTGATGATTTATAGACTCAAAGAAAACTCATGTTCAGAAGCTCTCTTCTCTTCTGGCCTCCTCT 
CTGTCTTCTTTCCCTCTTTCTTCTTATTTTAATTAGTAGCATCTACTCAGAGTCATGCAAGCTGG 
AAATCTTTCATTTTGCTTGTCAGTGGGGTAGGTCACTGAGTCTTAGTTTTTATTTTTTGAAATTT 
CAAC T T TCAGAT T CAGGGGGTACATGTGAAGGT T T GT T T TAT GAGTATAT TGCATGAT GCTGAGG 
TTTGGGGT 



fU 
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FIGURE 148 

MFRSSLLFWPPLCLLSLFLLILISSIYSESCKLEIFHFACQWGRSLSLSFYFLKFQLSDSGGTCE 
GLFYEYIA, 



Important features of the protein: 
Signal peptide: 

amino acids 1-25 

N-myristoylation site. 

amino acids 62-68 



m 

H 



ill 

I 

a 

m 



APP ID=10063716 



Page 302 of 322 



FIGURE 14Q 

GTCTCCGCGTCACAGGAACTTCAGCACCCACAGGGCGGACAGCGCTCCCCTCTACCTGGAGACTTGAC 
TCCCGCGCGCCCCAACCCTGCTTATCCCTTGACCGTCGAGTGTCAGAGATCCTGCAGCCGCCCAGTCC 
CGGCCCCTCTCCCGCCCCACACCCACCCTCCTGGCTCTTCCTGTTTTTACTCCTCCTTTTCATTCATA 
ACAAAAGCTACAGCTCCAGGAGCCCAGCGCCGGGCTGTGACCCAAGCCGAGCGTGGAAGAATGGGGTT 
CCTCGGGACCGGCACTTGGATTCTGGTGTTAGTGCTCCCGATTCAAGCTTTCCCCAAACCTGGAGGAA 
GCCAAGACAAATCTCTACATAATAGAGAATTAAGT GCAGAAAGACCT TTGAATGAACAGAT TGCTGAA 
GCAGAAGAAGACAAGATTAAAAAAACATATCCTCCAGAAAACAAGCCAGGTCAGAGCAACTATTCTTT 
TGTTGATAACTTGAACCTGCTAAAGGCAATAACAGAAAAGGAAAAAATTGAGAAAGAAAGACAATCTA 
TAAGAAGCT CCCCACT TGATAATAAGTT GAATGTGGAAGATGTTGATT CAACCAAGAATCGAAAACTG 
ATCGATGAT TAT GACT CTACTAAGAGTGGAT T GGATCATAAATT TCAAGAT GATCCAGAT GGTCTTCA 
TCAACTAGACGGGACT CCT TTAACCGCT GAAGACAT T GTCCATAAAAT CGCT GCCAGGATT TAT GAAG 
AAAATGACAGAGCCGTGTT TGACAAGAT TGT T TCT AAACT ACTTAATCTCGGCCT TATCACAGAAAGC 
CAAGCACATACACTGGAAGAT GAAGTAGCAGAGGT T TTACAAAAATTAATC T CAAAGGAAGCCAACAA 
T TATGAGGAGGAT CCCAATAAGCCCACAAGCTGGACT GAGAATCAGGCTGGAAAAATACCAGAGAAAG 
T GACT CCAATGGCAGCAAT TCAAGATGGT CTT GCTAAGGGAGAAAACGATGAAACAGTATCTAACACA 
TTAACCTTGACAAATGGCTTGGAAAGGAGAACTAAAACCTACAGTGAAGACAACTTTGAGGAACTCCA 
ATATTTCCCAAATTTCTATGCGCTACTGAAAAGTATTGATTCAGAAAAAGAAGCAAAAGAGAAAGAAA 
CACT GATTACTATCAT GAAAACACT GAT T GACTTTGT GAAGATGATGGTGAAATATGGAACAATATCT 
CCAGAAGAAGGTGTTTCCTACCTTGAAAACTTGGATGAAATGATTGCTCTTCAGACCAAAAACAAGCT 
AGAAAAAAATGCTACTGACAATATAAGCAAGCTTTTCCCAGCACCATCAGAGAAGAGTCATGAAGAAA 
CAGACAGTACCAAGGAAGAAGCAGCTAAGATGGAAAAGGAATATGGAAGCTTGAAGGATTCCACAAAA 
GATGATAACTCCAACCCAGGAGGAAAGACAGATGAACCCAAAGGAAAAACAGAAGCCTATTTGGAAGC 
CAT CAGAAAAAAT ATT GAATGGT TGAAGAAACATGACAAAAAGGGAAATAAAGAAGATTAT GACCTTT 
CAAAGAT GAGAGACTTCATCAATAAACAAGCT GAT GCTTATGTGGAGAAAGGCATCC TTGACAAGGAA 
GAAGCCGAGGCCATCAAGCGCATTTATAGCAGCCTGTAAAAATGGCAAAAGATCCAGGAGTCTTTCAA 
CTGTT TCAGAAAACAT AATAT AGCT TAAAACACTT CTAATTCTGTGATTAAAATT TTTT GACCCAAGG 
GTTATTAGAAAGTGCTGAATTTACAGTAGTTAACCTTTTACAAGTGGTTAAAACATAGCTTTCTTCCC 
GTAAAAACTATCTGAAAGTAAAGTTGTATGTAAGCTGAAAAAAAAAAAAAAAAAAAA 
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FIGURE ISO 



MGFLGTGTWILVLVLPIQAFPKPGGSQDKSLHNRELSAERPLNEQIAEAEEDKIKKTYPPENKPG 
QSNYS FVDNLNLLKAI TEKEKIEKERQS I RS S PLDNKLNVEDVDS TKNRKL IDDYDS TKS GLDHK 
FQDDPDGLHQLDGTPLTAEDIVHKIAARIYEENDRAVFDKIVSKLLNLGLITESQAHTLEDEVAE 
VLQKLISKEi^YEEDPNKPTSWTENQAGKIPEKVTPMAAIQDGLAKGENDETVSNTLTLTNGLE 
RRTKTYSEDNFEELQYFPNFYALLKSIDSEKEAE<EKETLITIMKTLIDFVKMWKYGTISPEEGV 
SYLENLDEMIALQTKNKLEKNATDNISKLFPAPSEKSHEETDSTKEEAAKMEKEYGSLKDSTKDD 
NSNPGGKTDEPKGKTEAYLEAIRKNIEWLKKHDKKGNKEDYDLSKMRDFINKQADAYVEKGILDK 
EEAEAIKRIYSSL 

N-glycosylation sites: 

amino acids 68-71, 346-349, 350-353 

Casein kinase II phosphorylation site: 

amino acids 70-73, 82-85, 97-100, 125-128, 147-150, 188-191, 217- 
220, 265-268, 289-292, 305-308, 320-323, 326-329, 362-365, 368- 
341, 369-372, 382-385, 386-389, 387-390 

N-myristoylation sites: 

amino acids 143-148, 239-244 
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FIGURE 151 

CGGC T CGAGGCTCCCGCCAGGAGAAAGGAACAT T CTGAGGGGAGT CTACAC CC TGT GGAGC T CAA 
G ATGG TCCTGAGTGGGGCGCTGTGCTTCCGAATGAAGGACTCGGCATTGAAGGTGCTTTATCTGC 
ATAATAAC CAGCTTC T AGC TGGAGGGC T GCATGCAGGGAAGGTCATTAAAGGT GAAGAGATCAGC 
GTGGTCCCCAATCGGTGGCTGGATGCCAGCCTGTCCCCCGTCATCCTGGGTGTCCAGGGTGGAAG 
CCAGTGCC TGT CATGT GGGGT GGGGCAGGAGCCGACT C TAACAC TAGAGCCAGT GAACAT CATGG 
AGCTCTATCTTGGTGCCAAGGAATCCAAGAGCTTCACCTTCTACCGGCGGGACATGGGGCTCACC 
TCCAGCTTCGAGTCGGCTGCCTACCCGGGCTGGTTCCTGTGCACGGTGCCTGAAGCCGATCAGCC 
TGTCAGACTCACCCAGCTTCCCGAGAATGGTGGCTGGAATGCCCCCATCACAGACTTCTACTTCC 
AGCAGTGTGACTAGGGCAACGTGCCCCCCAGAACTCCCTGGGCAGAGCCAGCTCGGGTGAGGGGT 
GAGTGGAGGAGACCCATGGCGGACAATCACTCTCTCTGCTCTCAGGACCCCCACGTCTGACTTAG 
TGGGCACCTGACCACTTTGTCTTCTGGTTCCCAGTTTGGATAAATTCTGAGATTTGGAGCTCAGT 
CCACGGTCCTCCCCCACTGGATGGTGCTACTGCTGTGGAACCTTGTAAAAACCATGTGGGGTA7\A 
^' CTGGGAATAACATGAAAAGATTTCTGTGGGGGTGGGGTGGGGGAGTGGTGGGAATCATTCCTGCT 
Cl TAATGGTAACTGACAAGTGTTACCCTGAGCCCCGCAGGCCAACCCATCCCCAGTTGAGCCTTATA 
f! GGGTCAGTAGCT C TCCACAT GAAGTCC TGTCACT CACCACT GTGCAGGAGAGGGAGGT GGT CATA 

Sft GAGTCAGGGATCTATGGCCCTTGGCCCAGCCCCACCCCCTTCCCTTTAATCCTGCCACTGTCATA 
Zl TGCTACCTTTCCTATCTCTTCCCTCATCATCTTGTTGTGGGCATGAGGAGGTGGTGATGTCAGAA 
W GAAATGGC TCGAGCT CAGAAGATAAAAGATAAGTAGGGTATGCTGATCC TC T T TT AAAAACCCAA 

% 4 GATACAAT CAAAAT CCCAGAT GC TGGT C TC TAT T CCCAT GAAAAAGTGCTCATGACAT AT TGAGA 

i«b AGACCTACTTACAAAGTGGCATATATTGCAAT TTATTTTAATTAAAAGATACCTAT TTATATATT 

f|| TCTTTATAGAAAAAAGTCTGGAAGAGTTTACTTCAATTGTAGCAATGTCAGGGTGGTGGCAGTAT 
AGGTGATTTTTCTTTTAATTCTGTTAATTTATCTGTATTTCCTAATTTTTCTACAATGAAGATGA 
: 3 „ AT TCC T T GTATAAAAAT AAGAAAAGAAAT TAATCTTGAGGT AAGCAGAGCAGACAT CAT CTCTGA 

0 TTGTCCTCAGCCTCCACTTCCCCAGAGTAAATTCAAATTGAATCGAGCTCTGCTGCTCTGGTTGG 
iff T T G TAG TAG T GAT CAGGAAACAGAT C T CAGC AAAGC CAC T GAG GAGGAGGC T G TG C T GAG T T T G T 

p| GTGGCTGGAAT CT CT GGGTAAGGAACT TAAAGAACAAAAATCAT CTGGTAAT T CT T T CCTAGAAG 

tjjl GATCACAGCCCCTGGGATTCCAAGGCATTGGATCCAGTCTCTAAGAAGGCTGCTGTACTGGTTGA 
ATTGTGTCCCCCTCAAATTCACATCCTTCTTGGAATCTCAGTCTGTGAGTTTATTTGGAGATAAG 
.0 GTCTC TGCAGAT GTAGT TAGT TAAGACAAGGT CAT GCTGGAT GAAGGTAGACCTAAAT T CAATAT 

f|| GACTGGTTTCCTTGTATGAAAAGGAGAGGACACAGAGACAGAGGAGACGCGGGGAAGACTATGTA 
AAGATGAAGGCAGAGAT CGGAGTTTTGCAGCCACAAGCTAAGAAACACCAAGGATT GT GGCAACC 
ATCAGAAGCTTGGAAGAGGCAAAGAAGAATTCTTCCCTAGAGGCTTTAGAGGGATAACGGCTCTG 
CTGAAACCT TAAT CT CAGAC T TCCAGCCTCC T GAACGAAGAAAGAATAAATT TCGGC TGT T TTAA 
GCCACCAAGGATAATTGGTTACAGCAGCTCTAGGAAACTAATACAGCTGCTAAAATGATCCCTGT 
CTCCTCGTGTTTACATTCTGTGTGTGTCCCCTCCCACAATGTACCAAAGTTGTCTTTGTGACCAA 
TAGAATATGGCAGAAGT GAT GGCATGCCACT T CCAAGAT T AGGT TATAAAAGACACT GCAGCT TC 
TACTTGAGCCCTCTCTCTCTGCCACCCACCGCCCCCAATCTATCTTGGCTCACTCGCTCTGGGGG 
AAGC TAGC TGCCAT GC TATGAGCAGGC C TATAAAGAGACT TACGTGGTAAAAAATGAAGT CTCCT 
GCCCACAGCCACAT TAGTGAACCTAGAAGCAGAGACTCTGTGAGATAATCGATGTT TGTTGTTT T 
AAGT TGCTCAGT TT TGGT CTAACT TGTTATGCAGCAATAGATAAATAATATGCAGAGAAAGAG 
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FIGURE 152 

MVLSGALC FRMKDSALKVLYLHNNQLLAGGLHAGKVI KGEE I S WPNRWLDAS LS PVI LGVQGGS 
QCLSCGVGQEPTLTLEPVNIMELYLGAKESKSFTFYRRDMGLTSSFESAAYPGWFLCTVPEADQP 
VRLTQLPENGGWNAPITDFYFQQCD 

N-myristoylation sites. 

amino acids 29-34, 30-35, 60-65, 63-68, 73-78, 91-96, 106-111 

Interleukin-1 signature. 

amino acids 111-131 

Interleukin-1 proteins. 

amino acids 8-29, 83-120, 95-134, 64-103 
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FIGURE 153 



CTTCAGAACAGGTTCTCCTTCCCCAGTCACCAGTTGCTCGAGTTAGAATTGTCTGCAATGGCCGC 
CCTGCAGAAATCTGTGAGCTCTTTCCTTATGGGGACCCTGGCCACCAGCTGCCTCCTTCTCTTGG 
CCC TC T TGGTACAGGGAGGAGCAGC T GCGCCCAT CAGC TCCCACT GCAGGCT TGACAAGT CCAAC 
TTCCAGCAGCCCTATATCACCAACCGCACCTTCATGCTGGCTAAGGAGGCTAGCTTGGCTGATAA 
CAACACAGACGT T CGT CTCAT TGGGGAGAAACT GT TCCACGGAGTCAGTATGAGTGAGCGCT GC T 
ATCTGATGAAGCAGGTGCTGAACTTCACCCTTGAAGAAGTGCTGTTCCCTCAATCTGATAGGTTC 
CAGCCTTATATGCAGGAGGTGGTGCCCTTCCTGGCCAGGCTCAGCAACAGGCTAAGCACATGTCA 
TAT TGAAGGTGAT GACC T GCATATCCAGAGGAAT GTGCAAAAGCTGAAGGACACAGT GAAAAAGC 
2 T TGGAGAGAGT GGAGAGATCAAAGCAAT T GGAGAACT GGAT TTGC TGTT TATGT C TC T GAGAAAT 

S GCC TGCATT TGACCAGAGCAAAGC TGAAAAAT GAATAACTAACCCCC T T TCCC TGCT AGAAATAA 

W 

ffl CAATTAGAT GCCCCAAAGCGAT TT T T TT TAACCAAAAGGAAGATGGGAAGCCAAAC T CCAT CATG 

|il ATGGGTGGATTCCAAATGAACCCCTGCGTTAGTTACAAAGGAAACCAATGCCACTTTTGTTTATA 



AGAC CAGAAGG T AGAC T T T C T AAGCATAGAT AT T TAT T GAT AAC AT T T CAT T G T AAC TGGTGTTC 
TATACACAGAAAACAAT T TAT TT T T TAAATAAT TGT CT T T TT CCATAAAAAAGATTACTT TCCAT 
TC C T T T AGGGGAAAAAACC C C T AAAT AG C T T CAT G T T TCCAT AATCAGT ACT T TAT AT T TAT AAA 
TGTATT TAT TAT TATTATAAGAC TGCATTT TAT T TATATCAT T TTAT TAATATGGAT TTAT T TAT 
AGAAACATCATTCGATAT TGCTACT TGAGTGTAAGGCTAATATTGATAT TTATGACAATAATTAT 
AGAGC T ATAACATGTT TAT T T GACC TCAATAAACAC TTGGATATCCC 
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FIGURE lZd 

MAALQKS VS S FLMG TLAT S CLLLLALLVQGGAAAP I S SHCRLDKSNFQQP Y I TNRT FMLAKEASL 
ADNNTDWLIGEKLFHGVSMSERCYMKQVLNFTLEEVLFPQSDRFQPYMQEWPFLARLSNRLS 
TCHIEGDDLHIQRNVQKLKDTVKKLGESGEIKAIGELDLLFMSLRNACI 

Important features of the protein: 
Signal peptide: 

amino acids 1-33 

N-glycosylation sites. 

amino acids 54-58, 68-72, 97-101 

N-myristoylation sites. 

amino acids 14-20, 82-88 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 10-21 
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FIGURE 155 

GGCTTGCTGAAAATAAAATCAGGACTCCTAACCTGCTCCAGTCAGCCTGCTTCCACGAGGCCTGT 
CAGTCAGTGCCCGACTTGTGACTGAGTGTGCAGTGCCCAGCATGTACCAGGTCAGTGCAGAGGGC 
TGCCT GAGGGCT GT GCT GAGAGGGAGAGGAGCAGAGATGCT GC TGAGGGTGGAGGGAGGCCAAGC 
TGCCAGGTTTGGGGCTGGGGGCCAAGTGGAGTGAGAAACTGGGATCCCAGGGGGAGGGTGCAGAT 
GAGGGAGCGACCCAGAT T AGGTGAGGACAGT T CTCTCAT TAGCC T T TT CC TACAGGTGGT TGCAT 
TCTTGGCAATGGTCATGGGAACCCACACCTACAGCCACTGGCCCAGCTGCTGCCCCAGCAAAGGG 
CAGGACACCTCTGAGGAGCTGCTGAGGTGGAGCACTGTGCCTGTGCCTCCCCTAGAGCCTGCTAG 
GCCCAACCGCCACCCAGAGTCCTGTAGGGCCAGTGAAGATGGACCCCTCAACAGCAGGGCCATCT 
CCCCCTGGAGATATGAGTTGGACAGAGACTTGAACCGGCTCCCCCAGGACCTGTACCACGCCCGT 
TGCCTGTGCCCGCACTGCGTCAGCCTACAGACAGGCTCCCACATGGACCCCCGGGGCAACTCGGA 
GCTGCTCTACCACAACCAGACTGTCTTCTACAGGCGGCCATGCCATGGCGAGAAGGGCACCCACA 
AGGGCTACTGCCTGGAGCGCAGGCTGTACCGTGTTTCCTTAGCTTGTGTGTGTGTGCGGCCCCGT 
GTGATGGGCTAGCCGGACCTGCTGGAGGCTGGTCCCTTTTTGGGAAACCTGGAGCCAGGTGTACA 
ACCACTTGCCATGAAGGGCCAGGATGCCCAGATGCTTGGCCCCTGTGAAGTGCTGTCTGGAGCAG 
CAGGATCCCGGGACAGGATGGGGGGCTTTGGGGAAAACCTGCACTTCTGCACATTTTGAAAAGAG 
CAGCTGCTGCTTAGGGCCGCCGGAAGCTGGTGTCCTGTCATTTTCTCTCAGGAAAGGTTTTCAAA 
GTTCTGCCCATTTCTGGAGGCCACCACTCCTGTCTCTTCCTCTTTTCCCATCCCCTGCTACCCTG 
GCCCAGCACAGGCACTTTCTAGATATTTCCCCCTTGCTGGAGAAGAAAGAGCCCCTGGTTTTATT 
TGTTTGTTTACTCATCACTCAGTGAGCATCTACTTTGGGTGCATTCTAGTGTAGTTACTAGTCTT 
TTGACATGGATGAT TCTGAGGAGGAAGC TGTTATTGAATGTATAGAGATTTATCCAAATAAATAT 
CTTTATTTAAAAATGAAAAA 
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FIGURE 1*6 

MRERPRLGEDSS L I S L FLQWAFLAMVMGTHT YSHWPS CCPSKGQDT SEELLRWS T VPVPPLE PA 
RPNRHPESCRASEDGPLNSRAISPWRYELDRDLNRLPQDLYHARCLCPHCVSLQTGSHMDPRGNS 
ELLYHNQTVFYRRPCHGEKGTHKGYCLERRLYRVSLACVCVRPRVMG 

Important features of the protein: 
Signal peptide: 

amino acids 1-32 

N-glycosylation site, 

amino acids 136-140 

Tyrosine kinase phosphorylation site* 

amino acids 127-135 

N-myristoylation sites, 
amino acids 44-50, 150-156 
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FIGURE 157 



ill 



CCGGCGATGTCGCTCGTGCTGCTAAGCCTGGCCGCGCTGTGCAGGAGCGCCGTACCCCGAGAGCC 
GACCGTTCAATGTGGCTCTGAAACTGGGCCATCTCCAGAGTGGATGCTACAACATGATCTAATCC 
CCGGAGACT T GAGGGACC TCCGAGTAGAACCT GT TACAAC TAGTGT TGCAACAGGGGACTAT T CA 
ATTTTGATGAATGTAAGCTGGGTACTCCGGGCAGATGCCAGCATCCGCTTGTTGAAGGCCACCAA 
GATTTGTGTGACGGGCAAAAGCAACTTCCAGTCCTACAGCTGTGTGAGGTGCAATTACACAGAGG 
CCTTCCAGACTCAGACCAGACCCTCTGGTGGTAAATGGACATTTTCCTACATCGGCTTCCCTGTA 
GAGCTGAACACAGTCTATTTCATTGGGGCCCATAATATTCCTAATGCAAATATGAATGAAGATGG 
CCCTTCCATGTCTGTGAATTTCACCTCACCAGGCTGCCTAGACCACATAATGAAATATAAAAAAA 
AGTGTGTCAAGGCCGGAAGCCTGTGGGATCCGAACATCACTGCTTGTAAGAAGAATGAGGAGACA 
GTAGAAGTGAACT T CACAACCAC T CCCC TGGGAAACAGAT ACATGGC TC T TATCCAACACAGCAC 
TATCATCGGGTTTTCTCAGGTGTTTGAGCCACACCAGAAGAAACAAACGCGAGCTTCAGTGGTGA 
TTCCAGTGACTGGGGATAGTGAAGGTGCTACGGTGCAGCTGACTCCATATTTTCCTACTTGTGGC 
s \ AGCGACTGCATCCGACATAAAGGAACAGTTGTGCTCTGCCCACAAACAGGCGTCCCTTTCCCTCT 
|p GGATAACAACAAAAGCAAGCCGGGAGGCTGGCTGCCTCTCCTCCTGCTGTCTCTGCTGGTGGCCA 
i CATGGGTGCTGGTGGCAGGGATCTATCTAATGTGGAGGCACGAAAGGATCAAGAAGACTTCCTTT 
CI TCTACCACCACACTACTGCCCCCCATTAAGGTTCTTGTGGTTTACCCATCTGAAATATGTTTCCA 
*M TCACACAATT TGTTACT T CACTGAATT TC TTCAAAACCAT T GCAGAAGTGAGG T CAT CCT T GAAA 

jjSjJ AGT GGCAGAAAAAGAAAATAGCAGAGATGGGT CCAGTGCAGT GGC TTGCCAC TCAAAAGAAGGCA 

« GCAGACAAAGTCGTCTTCCTTCTTTCCAATGACGTCAACAGTGTGTGCGATGGTACCTGTGGCAA 
f|| GAGCGAGGGCAGTCCCAGTGAGAACTCTCAAGACCTCTTCCCCCTTGCCTTTAACCTTTTCTGCA 
GTGATC TAAGAAGCCAGAT T CAT CT GCACAAATACGTGGT GGT CTAC TT TAGAGAGATT GATACA 
AAAGACGATTACAATGC T CTCAGT GTC TGCCCCAAGTACCACC TCATGAAGGATGCCAC TGCT TT 
CTGTGCAGAACTTCTCCATGTCAAGCAGCAGGTGTCAGCAGGAAAAAGATCACAAGCCTGCCACG 
ATGGCTGCTGCTCCTTGTAG 
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FIGURE 158 



MSLVLLSLAMiCRSAVPREPTVQCGSETGPSPEWIxnLQHDLIPGDLRDLRVEPVTTSVATGDYSIimfVSWV 

LRADASIRLLECATKICVTGKSNFQSYSCVRCNYTEAFQTQTRPSGGKWTFSYIGFPVELNTVYFIGAHNIP 

NANMNEDGPSMSTOFTSPGCLDHIMKYKKKCVKAGSLWDPNITACKKNEETVEVNFTTTP 

STIIGFSQVFEPHQKKQTRASWIPVTGDSEGATVQLTPYFPTCGSDCIRHKGTWLCPQTGVPFPLDNNK 

SKPGGWLPLLLLSLLVATWVLVAGIYLMWRHERIBCKTSFSTTTLLPPIKVLVVYPSEICFHHTICYFTEFL 

QNHCRSEVILEKWQKKKIAEMGPVQWLATQKKAADKVVFLLSNDVNSVCDGTCGKSEGSPSENSQDLFPLA 

FNLFCSDLRSQIHLHKYVWYFREIDTKDDYNALSVCPKYHLMKDATAFCAELLHVKQQVSAGKRSQACHD 

GCCSL 



f'| Important features of the protein: 

Q Signal peptide: 

p! amino acids 1-14 

*H Transmembrane domain: 



amino acids 290-309 



N-glycosylation sites. 



gVf amino acids 67 - 71, 103 - 107, 156 - 160, 183 - 187, 197 - 201 and 283 



f?|. - 287 

C9 



cAMP- and cGMP-dependent protein kinase phosphorylation sites. 



Ill amino acids 228 - 232 and 319 - 323 



Casein kinase II phosphorylation sites. 

amino acids 178 - 182, 402 - 406, 414 - 418 and 453 - 457 



N-myristoylation site. 

amino acids 116-122 

Amidation site. 

amino acids 488-452 
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FIGURE i£Q 

AGCCACCAGCGCAAC ATGA CAGTGAAGACCCTGCATGGCCCAGCCATGGTCAAGTACTTGCTGCT 
GTCGATATTGGGGCTTGCCTTTCTGAGTGAGGCGGCAGCTCGGAAAATCCCCAAAGTAGGACATA 
CTTTTTTCCAAAAGCCTGAGAGTTGCCCGCCTGTGCCAGGAGGTAGTATGAAGCTTGACATTGGC 
ATCATCAATGAAAACCAGCGCGTTTCCATGTCACGTAACATCGAGAGCCGCTCCACCTCCCCCTG 
GAATTACACTGTCACTTGGGACCCCAACCGGTACCCCTCGGAAGTTGTACAGGCCCAGTGTAGGA 
ACTTGGGCTGCATCAATGCTCAAGGAAAGGAAGACATCTCCATGAATTCCGTTCCCATCCAGCAA 
GAGACCCTGGTCGTCCGGAGGAAGCACCAAGGCTGCTCTGTTTCTTTCCAGTTGGAGAAGGTGCT 
GGTGACTGTTGGCTGCACCTGCGTCACCCCTGTCATCCACCATGTGCAGTAAGAGGTGCATATCC 
AC TCAGC TGAAGAAG 
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FIGURE 160 



MTVKTLHGPAMVKYLLLSILGI^ 

QRVSMSRNIESRSTSPWNYTVTWDPNRYPSEWQAQCRNLGC INAQGKEDI SMNSVPIQQETLW 
RRKHQGCSVS FQLEKVLVTVGCTCVTPVIHHVQ 



-•Si" 

3 



HI 
s "is- 



Signal sequence: 

amino acids 1-30 

N-glycosylation site. 

amino acids 83-87 

N-myristoylation sites. 



Ksf amino acids 106-111, 136-141 
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FIGURE 161 



ACAC TGGCCAAACAAAAACGAAAGCACTCCGT GCT GGAAGTAGGAGGAGAGT CAGGACT CCCAGG 
ACAGAGAGTGCACAAACTACCCAGCACAGCCCCCTCCGCCCCCTCTGGAGGCTGAAGAGGGATTC 
CAGCCCCTGCCACCCACAGACACGGGCTGACTGGGGTGTCTGCCCCCCTTGGGGGGGGGCAGCAC 
AGGGCCTCAGGCCTGGGTGCCACCTGGCACCTAGAAGATGCCTGTGCCCTGGTTCTTGCTGTCCT 
TGGCACTGGGCCGAAGCCCAGTGGTCCTTTCTCTGGAGAGGCTTGTGGGGCCTCAGGACGCTACC 
CACTGCTCTCCGGGCCTCTCCTGCCGCCTCTGGGACAGTGACATACTCTGCCTGCGTGGGGACAT 
CGTGCCTGCTCCGGGCCCCGTGCTGGCGCCTACGCACCTGCAGACAGAGCTGGTGCTGAGGTGCC 
AGAAGGAGACCGACTGTGACCTCTGTCTGCGTGTGGCTGTCCACTTGGCCGTGCATGGGCACTGG 
GAAGAGCCTGAAGAT GAGGAAAAGT T T GGAGGAGCAGCTGACTCAGGGGTGGAGGAGCCTAGGAA 
TGCCTCTCTCCAGGCCCAAGTCGTGCTCTCCTTCCAGGCCTACCCTACTGCCCGCTGCGTCCTGC 
TGGAGGTGCAAGTGCCTGCTGCCCTTGTGCAGTTTGGTCAGTCTGTGGGCTCTGTGGTATATGAC 
TGCTTCGAGGCTGCCCTAGGGAGTGAGGTACGAATCTGGTCCTATACTCAGCCCAGGTACGAGAA 
GGAACTCAACCACACACAGCAGCTGCCTGCCCTGCCCTGGCTCAACGTGTCAGCAGATGGTGACA 
ACGTGCATCTGGTTCTGAATGTCTCTGAGGAGCAGCACTTCGGCCTCTCCCTGTACTGGAATCAG 
GTCCAGGGCCCCCCAAAACCCCGGTGGCACAAAAACCTGACTGGACCGCAGATCATTACCTTGAA 
CCACACAGACCTGGTTCCCTGCCTCTGTATTCAGGTGTGGCCTCTGGAACCTGACTCCGTTAGGA 
CGAACATCTGCCCCTTCAGGGAGGACCCCCGCGCACACCAGAACCTCTGGCAAGCCGCCCGACTG 
CGACTGCTGACCCTGCAGAGCTGGCTGCTGGACGCACCGTGCTCGCTGCCCGCAGAAGCGGCACT 
GTGCTGGCGGGCTCCGGGTGGGGACCCCTGCCAGCCACTGGTCCCACCGCTTTCCTGGGAGAACG 
TCACTGTGGACAAGGTTCTCGAGTTCCCATTGCTGAAAGGCCACCCTAACCTCTGTGTTCAGGTG 
AACAGCTCGGAGAAGCTGCAGCTGCAGGAGTGCTTGTGGGCTGACTCCCTGGGGCCTCTCAAAGA 
« CGATGTGCTACTGTTGGAGACACGAGGCCCCCAGGACAACAGATCCCTCTGTGCCTTGGAACCCA 
fl GTGGCTGTACTTCACTACCCAGCAAAGCCTCCACGAGGGCAGCTCGCCTTGGAGAGTACTTACTA 
ip CAAGACCTGCAGTCAGGCCAGTGTCTGCAGCTATGGGACGATGACTTGGGAGCGCTATGGGCCTG 
CCCCATGGACAAATACATCCACAAGCGCTGGGCCCTCGTGTGGCTGGCCTGCCTACTCTTTGCCG 
J*? CTGCGCTTTCCCTCATCCTCCTTCTCAAAAAGGATCACGCGAAAGGGTGGCTGAGGCTCTTGAAA 
|0 CAGGACGTCCGCTCGGGGGCGGGCGCCAGGGGCCGCGCGGCTCTGCTCCTCTACTCAGCCGATGA 
Q CTCGGGTTTCGAGCGCCTGGTGGGCGCCCTGGCGTCGGCCCTGTGCCAGCTGCCGCTGCGCGTGG 
PI CCGTAGACCTGTGGAGCCGTCGTGAACTGAGCGCGCAGGGGCCCGTGGCTTGGTTTCACGCGCAG 
• ^ CGGCGCCAGACCCTGCAGGAGGGCGGCGTGGTGGTCTTGCTCTTCTCTCCCGGTGCGGTGGCGCT 
GTGCAGCGAGTGGCTACAGGATGGGGTGTCCGGGCCCGGGGCGCACGGCCCGCACGACGCCTTCC 
GCGCCTCGCTCAGCTGCGTGCTGCCCGACTTCTTGCAGGGCCGGGCGCCCGGCAGCTACGTGGGG 
GCCTGCTTCGACAGGCTGCTCCACCCGGACGCCGTACCCGCCCTTTTCCGCACCGTGCCCGTCTT 
CACACTGCCCTCCCAACTGCCAGACTTCCTGGGGGCCCTGCAGCAGCCTCGCGCCCCGCGTTCCG 
GGCGGCTCCAAGAGAGAGCGGAGCAAGTGTCCCGGGCCCTTCAGCCAGCCCTGGATAGCTACTTC 
CATCCCCCGGGGACTCCCGCGCCGGGACGCGGGGTGGGACCAGGGGCGGGACCTGGGGCGGGGGA 
CGGGACTTAAATAAAGGCAGACGCTGTTTTTCTAAAAAAA 



If! 
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FIGURE 162 



MPVPWFLLSLALGRSPWLSLERLVGPQDATHCSPGLSCRLWDSDILCLPGDIVPAPGPVLAPTHLQTELV 
LRCQKETDCDLCLRVAVHLAVHGHWE E P EDEEKFGGAADSGVEE PRNASLQAQVVL S FQAY PTARC VLLE V 

qvpaalvqfgqsvgsvvyix:feaalgsevriwsytqpryebcelnhtqqlpalpwlnvsadgdnvhlvlots 
eeqhfglslywnqvqgppkprwhknltgpqiitlnhtdlvpclciqvwplepdsvrtnicpfredprahqn 

LWQAARLRLLTLQSWLLDAPCSLPAEAALCWRAPGGDPCQPLVPPLSWENVTVDKVLEFPLLKGHPNLCVQ 
VNSSEKLQLQECLWADSLGPLKDDVLLLETRGPQDNRSLCALEPSGCTSLPSKASTRAARLGEYLLQDLQS 
GQCLQLWDDDLGALWAC PMDKY I HBCRWALVWLACLLFAAALSLI LLLKKDHAKGWLRLLKQDVRSGAAARG 
RAALLL YSADDSGFERLVGALASALCQL PLRVAVDLWS RRELSAQGPVAW FHAQRRQTLQEGGVWLL FSP 
GAVALCSEWLQDGVSGPGAHGPHDAFRASLSCVLPDFLQGRAPGSYVGACFDRLLHPDAVPALFRTVPVFT 
LPSQLPDFLGALQQPRAPRSGRLQERAEQVSRALQPALDSYFHPPGTPAPGRGVGPGAGPGAGDGT 

Signal sequence: 

amino acids 1-20 

Transmembrane domain. 

amino acids 453-475 

N-glycosylation sites. 

amino acids 118-121, 186-189, 198-201, 211-214, 238-241, 248-251, 
334-337, 357-360, 391-394 

Glycosaminoglycan attachment site. 

amino acids 583-586 

cAMP- and cGMP -dependent protein kinase phosphorylation site. 

amino acids 552-555 

N-myristoylation sites. 

amino acids 107-112, 152-157, 319-324, 438-443, 516-521, 612-617, 
692-697, 696-701, 700-705 
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FIGURE 163 



GGGAGGGCTCTGTGCCAGCCCCGATGAGGACGCTGCTGACCATCTTGACTGTGGGATCCCTGGCT 
GCTCACGCCCCTGAGGACCCCTCGGATCTGCTCCAGCACGTGAAATTCCAGTCCAGCAACTTTGA 
AAACAT CC TGACGT GGGACAGCGGGCCAGAGGGCACCCCAGACACGGTC TACAG CAT C GAG TATA 
AGACGTACGGAGAGAGGGACT GGG TGGCAAAGAAGGGCT GTCAGCGGATCACCCGGAAGT CCTGC 
AACCTGACGGTGGAGACGGGCAACCTCACGGAGCTCTACTATGCCAGGGTCACCGCT 
GTCAGTGCGGGAGGCCGGT CAGCCACCAAGAT GACT GACAGGT TCAGCT C TC TGCAGCACAC TAC 
CCTCAAGCCACCTGATGTGACCTGTATCTCCAAAGTGAGATCGAT TCAGATGATTGT TCAT CCTA 
CCCCCACGCCAATCCGTGCAGGCGATGGCCACCGGCTAACCCTGGAAGACATCTTCCATGACCTG 
TTCTACCACT TAGAGCTCCAGGTCAACCGCACCTACCAAATGCACCT TGGAGGGAAGCAGAGAGA 
ATATGAGTTCTTCGGCCTGACCCCTGACACAGAGTTCCTTGGCACCATCATGATTTGCGTTCCCA 
CCTGGGCCAAGGAGAGTGCCCCCTACATGTGCCGAGTGAAGACACTGCCAGACCGGACATGGACC 
|4 TACTCCTTCTCCGGAGCCTTCCTGTTCTCCATGGGCTTCCfCGTCGCAGTACTCTGCTACCTGAG 
fl CTACAGATATGTCACCAAGCCGCCTGCACCTCCCAACTCCCTGAACGTCCAGCGAGTCCTGACTT 
m TCCAGCCGCTGCGCTTCATCCAGGAGCACGTCCTGATCCCTGTCTTTGACCTCAGCGGCCCCAGC 
2? AGTCTGGCCCAGCCTGTCCAGTACTCCCAGATCAGGGTGTCTGGACCCAGGGAGCCCGCAGGAGC 
CP TCCACAGCGGCATAGCC T GTCCGAGATCACCTACT TAGGGCAGCCAGACAT CT CCAT CCTCCAGC 

pi CCTCCAACGTGCCACCTCCCCAGATCCTCTCCCCACTGTCCTATGCCCCAAACGCTGCCCCTGAG 
\| GTCGGGCCCCCATCCTATGCACCTCAGGTGACCCCCGAAGCTCAATTCCCATTCTACGCCCCACA 
l2 GGCCATCTCTAAGGTCCAGCCTTCCTCCTATGCCCCTCAAGCCACTCCGGACAGCTGGCCTCCCT 
^ CCTATGGGGTATGCATGGAAGGTTCTGGCAAAGACTCCCCCACTGGGACACTTTCTAGTCCTAAA 
9* CACCTTAGGCCTAAAGGTCAGCTTCAGAAAGAGCCACCAGCTGGAAGCTGCATGTTAGGTGGCCT 
* TTCTCTGCAGGAGGTGACCTCCTTGGCTATGGAGGAATCCCAAGAAGCAAAATCATTGCACCAGC 
pi CCC TGGGGAT T TGCACAGACAGAACATC TGACCCAAAT G T GCTACACAGTGGGGAGGAAGGGACA 

m CCACAGTACCTAAAGGGCCAGCTCCCCCTCCTCTCCTCAGTCCAGATCGAGGGCCACCCCATGTC 
%l CCTCCCTTTGCAACCTCCTTCCGGTCCATGTTCCCCCTCGGACCAAGGTCCAAGTCCCTGGGGCC 
W TGCTGGAGTCCCTTGTGTGTCCCAAGGATGAAGCCAAGAGCCCAGCCCCTGAGACCTCAGACCTG 
|!| GAGCAGCCCACAGAACTGGATTCTCTTTTCAGAGGCCTGGCCCTGACTGTGCAGTGGGAGTCCTG 
p AGGGGAATGGGAAAGGCTTGGTGCTTCCTCCCTGTCCCTACCCAGTGTCACATCCTTGGCTGTCA 
fjj ATCCCATGCCTGCCCATGCCACACACTCTGCGATCTGGCCTCAGACGGGTGCCCTTGAGAGAAGC 
AGAGGGAGTGGCATGCAGGGCCCCTGCCATGGGTGCGCTCCTCACCGGAACAAAGCAGCATGATA 
AGGACTGCAGCGGGGGAGCTCTGGGGAGCAGCTTGTGTAGACAAGCGCGTGCTCGCTGAGCCCTG 
CAAGGCAGAAATGACAGTGCAAGGAGGAAATGCAGGGAAACTCCCGAGGTCCAGAGCCCCACCTC 
CTAACACCATGGATTCAAAGTGCTCAGGGAATTTGCCTCTCCTTGCCCCATTCCTGGCCAGTTTG 
ACAATCTAGCTCGACAGAGCATGAGGCCCCTGCCTCTTCTGTCATTGTTCAAAGGTGGGAAGAGA 
GCCTGGAAAAGAACCAGGCCTGGAAAAGAACCAGAAGGAGGCTGGGCAGAACCAGAACAACCTGC 
ACTTCTGCCAAGGCCAGGGCCAGCAGGACGGCAGGACTCTAGGGAGGGGTGTGGCCTGCAGCTCA 
TTCCCAGCCAGGGCAACTGCCTGACGTTGCACGATTTCAGCTTCATTCCTCTGATAGAACAAAGC 
GAAAT GCAGG T CCACCAGGGAGGGAGACACACAAGCCTT T TC TGCAGGCAGGAGT T T CAGACCC T 
ATCCTGAGAATGGGGTTTGAAAGGAAGGTGAGGGCTGTGGCCCCTGGACGGGTACAATAACACAC 
TGTACTGATGTCACAACTTTGCAAGCTCTGCCTTGGGTTCAGCCCATCTGGGCTCAZy\.TTCCAGC 
CTCACCACTCACAAGCTGTGTGACTTCAAACAAATGAAATCAGTGCCCAGAACCTCGGTTTCCTC 
ATCTGTAATGTGGGGAT CAT AACACCTACCTCATGGAGTTGTGGTGAAGAT GAAAT GAAGT CAT G 
TCTTTAAAGTGCTTAATAGTGCCTGGTACATGGGCAGTGCCCAATAAACGGTAGCTATTTAAAAA 
AAAAAAAA 
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FIGURE 164 
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m 



f 1 ! 
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MRTLLTILTVGSLAAHAPEDPSDLLQHVKFQSSNFENILTWDSGPEGTPDTVYSIEYKTYGERDW 
VAKKGCQRITRKSCNLTVETGNLTELYYARVTAVSAGGRSATKMTDRFSSLQHTTLKPPDVTCIS 
KVRSIQMIVHPTPTPIRAGDGHRLTLEDIFHDLFYHLELQVNRTYQMHLGGKQREYEFFGLTPDT 
EFLGTIMICVPTWAKESAPYMCRVKTLPDRTWTYSFSGAFLFSMGFLVAVLCYLSYRYVTKPPAP 
PNS LNVQRVLT FQPLRFI QEHVL I P VFDLSGPSS LAQPVQYS QIRVS GPRE PAGAPQRHS LSE I T 
YLGQPDISILQPSNVPPPQILSPLSYAPNAAPEVGPPSYAPQVTPEAQFPFYAPQAISKVQPSSY 
APQATPDSWPPSYGVCMEGSGKDSPTGTLSSPKHLRPKGQLQKEPPAGSCMLGGLSLQEVTSLAM 
EESQEAKSLHQPLGICTDRTSDPNVLHSGEEGTPQYLKGQLPLLSSVQIEGHPMSLPLQPPSGPC 
SPSDQGPSPWGLLESLVCPKDEAKSPAPETSDLEQPTELDSLFRGLALTVQWES 

Signal sequence. 

amino acids 1-17 

Transmembrane domain. 

amino acids 233-250 



Iff N-glycosylation sites. 

CI amino acids 80-83, 87-90, 172-175 



N~myristoylation sites. 

amino acids 11-16, 47-52, 102-107, 531-536, 565-570 



APP 



ID= 



10063716 
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FIGURE 16 s 

TGGCCTACTGGAAAAAAAAAAAAARAAAAAAAAAGTCACCCGGGCCCGCGGTGGCCACAAC ATGG 
CTGCGGCGCCGGGGCTGCTCTTCTGGGTGTTCGTGCTGGGGGCGCTCTGGTGGGTCCCGGGCCAG 
TCGGATCTCAGCCAGGGACGGCGTTTCTCGGACCTCAAAGTGTGCGGGGACGAAGAGTGCAGCAT 
GTTAATGTACCGTGGGAAAGCTCTTGAAGACTTCACGGGCCCTGATTGTCGTTTTGTGAATTTTA 
AAAAAGGTGACGATGTATATGTCTACTACAAACTGGCAGGGGGATCCGTTGAACTTTGGGCTGGA 
AGTGTTGAACACAGT TT TGGATATT TTCCAAAAGAT TTGATCAAGGTACTTCATAAATACACGGA 
AGAAGAGCTACATAT TCCAGCAGATGAGACAGACT TTGTCTGCTTTGAAGGAGGAAGAGATGATT 
T TAATAGT TATAATG TAGAAGAGC T T TTAGGATCT T T GGAACTGGAGGAC TCTGTACCTGAAGAG 
TCGAAGAAAGCTGAAGAAGTTTCTCAGCACAGAGAGAAATCTCCTGAGGAGTCTCGGGGGCGTGA 
ACTTGACCCTGTGCCTGAGCCCGAGGCATTCAGAGCTGATTCAGAGGATGGAGAAGGTGCTTTCT 
CAGAGAGCACCGAGGGGCTGCAGGGACAGCCCTCAGCTCAGGAGAGCCACCCTCACACCAGCGGT 
CCTGCGGCTAACGCTCAGGGAGTGCAGTCTTCGTTGGACACTTTTGAAGAAATTCTGCACGATAA 
ATTGAAAGTGCCGGGAAGCGAAAGCAGAACTGGCAATAGTTCTCCTGCCTCGGTGGAGCGGGAGA 
AGACAGAT GC TT ACAAAGTCCTGAAAACAGAAAT GAGT CAGAGAGGAAGTGGACAGTGCGT TAT T 
CAT TACAGCAAAGGATT TCGT T GGCAT CAAAAT CTAAGT T TGT T T TACAAAGAT TGT T TT TAGTA 
C TAAGC TGCC T TGGCAGT T T GCAT T T T TGAGCCAAACAAAAATATAT TAT T TTCCC T T CTAAGTA 
AAAAAAAAAAAAAAAAAAAA 
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FIGURE 166 



MAAAPGLLFWLFVXGALWWPGQSDLSHGRRFSDLKVCGDEECSMLMYRGKALEDFTGPDCRFW 
FKKGDDVYVYYKLAGGSLELWAGSVEHS FGYFPKDL IKVLHKYTEEELHI PADETDFVCFEGGRD 
DFNSYNVEELLGSLELEDSVPEESKKAEEVSQHREKSPEESRGRELDPVPEPEAFRADSEDGEGA 
FSESTEGLQGQPSAQESHPHTSGPAANAQGVQSSLDTFEEILHDKLKVPGSESRTGNSSPASVER 
EKTDAYKVLKTEMSQRGSGQCVIH YSKGFRWHQNLS L FYKDC F 

Important features of the protein: 
Signal peptide: 

amino acids 1-22 



cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 30-34 




N-glycosylation site. 



amino acids 294-298 



Ml 



Tyrosine kinase phosphorylation site. 



amino acids 61-16 



N-myristoylation sites. 

amino acids 205-211, 225-231, 277-283 



Ami da ti on site. 



amino acids 28-32 
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FIGURE 167 



CCAGGACCAGGGCGCACCGGCTCAGCCTCTCACTTGTCAGAGGCCGGGGAAGAGAAGCAAAGCGC 
AACGGTGTGGTCCAAGCCGGGGCTTCTGCTTCGCCTCTAGGACATACACGGGACCCCCTAACTTC 
AGTCCCCCAAACGCGCACCCTCGAAGTCTTGAACTCCAGCCCCGCACATCCACGCGCGGCACAGG 
CGCGGCAGGCGGCAGGTCCCGGCCGAAGGCGATGCGCGCAGGGGGTCGGGCAGCTGGGCTCGGGC 
GGCGGGAGTAGGGCCCGGCAGGGAGGCAGGGAGGCTGCATATTCAGAGTCGCGGGCTGCGCGCTG 
GGCAGAGGCCGCCCTCGCTCCACGCAACACCTGCTGCTGCCACCGCGCCGCGATGAGCCGCGTGG 
TCTCGCTGCTGCTGGGCGCCGCGCTGCTCTGCGGCCACGGAGCCTTCTGCCGCCGCGTGGTCAGC 
GGCCAAAAGGTGTGTTTTGCTGACTTCAAGCATCCCTGCTACAAAATGGCCTACTTCCATGAACT 
GTCCAGCCGAGTGAGCTTTCAGGAGGCACGCCTGGCTTGTGAGAGTGAGGGAGGAGTCCTCCTCA 
GCCTTGAGAATGAAGCAGAACAGAAGTTAATAGAGAGCATGTTGCAAAACCTGACAAAACCCGGG 
H ACAGGGATTTCTGATGGTGATTTCTGGATAGGGCTTTGGAGGAATGGAGATGGGCAAACATCTGG 
-Q * TGCCTGCCCAGATCTCTACCAGTGGTCTGATGGAAGCAATTCCCAGTACCGAAACTGGTACACAG 
Q ATGAACCTTCCTGCGGAAGTGAAAAGTGTGTTGTGATGTATCACCAACGAACTGCCAATCCTGGC 
lit CTTGGGGGTCCCTACCTTTACCAGTGGAATGATGACAGGTGTAACATGAAGCACAATTATATTTG 
III CAAGTATGAACCAGAGATTAATCCAACAGCCCCTGTAGAAAAGCCTTATCTTACAAATCAACCAG 
H GAGACACCCATCAGAATG TGGT TGT TACTGAAGCAGGTAT AAT TCCCAATCTAAT T TATGTTGT T 

U ATACCAACAATACCCCTGCTCTTACTGATACTGGTTGCTTTTGGAACCTGTTGTTTCCAGATGCT 
CR GCATAZU^GTAAAGGAAGAACAAAAACTAGTCCAAACCAGTCTACACTGTGGATTTCAAAGAGTA 
^ CCAGAAAAGAAAGTGGCATGGAAGTATAATAACTCATTGACTTGGTTCCAGAATTTTGTAATTCT 
O GGATCTGTATAAGGAATGGCATCAGAACAATAGCTTGGAATGGCTTGAAATCACAAAGGATCTGC 
il AAGATGAACTGTAAGCTCCCCCTTGAGGCAAATATTAAAGTAATTTTTATATGTCTATTATTTCA 
0 TTTAAA.GAATATGGTGTGCTAATAATGGAGTGAGACATGCTTATTTTGCTAAAGGATGCACCCAA 
if ACTTCAAACTTCAAGCAAATGAAATGGACAATGCAGATAAAGTTGTTATCAACACGTCGGGAGTA 
Cl TGTGTGTTAGAAGCAATTCCTTTTATTTCTTTCACCTTTCATAAGTTGTTATCTAGTCAATGTAA 
Rl TGTATATTGTATTGAAATTTACAGTGTGCAAAAGTATTTTACCTTTGCATAAGTGTTTGATAAAA 
ATGAACTGTTCTAATATTTATTTTTATGGCATCTCATTTTTCAATACATGCTCTTTTGATTAAAG 
AAAC T TAT T ACT GT TGT CAACT GAAT TCACAC ACAC ACAAATATAGTACC AT AGAAAAAG T T T G T 
TTTCTCGAAATAATTCATCTTTCAGCTTCTCTGCTTTTGGTCAATGTCTAGGAAATCTCTTCAGA 
AATAAGAAGCTATTTCATTAAGTGTGATATAAACCTCCTCAAACATTTTACTTAGAGGCAAGGAT 
TGTCTAATTTCAATTGTGCAAGACATGTGCCTTATAATTATTTTTAGCTTAAAATTAAACAGATT 
TTGTAATAATGTAACTTTGTTAATAGGTGCATAAACACTAATGCAGTCAATTTGAACAAAAGAAG 
TGACATACACAATATAAATCATATGTCTTCACACGTTGCCTATATAATGAGAAGCAGCTCTCTGA 
GGGTTCTGAAATCAATGTGGTCCCTCTCTTGCCCACTAAACAAAGATGGTTGTTCGGGGTTTGGG 
ATTGACACTGGAGGCAGATAGTTGCAAAGTTAGTCTAAGGTTTCCCTAGCTGTATTTAGCCTCTG 
ACTATATTAGTATACAAAGAGGTCATGTGGTTGAGAGCAGGTGAATAGTCACTATCAGTGTGGAG 
ACAAGCACAGCACACAGACATTTTAGGAAGGAAAGGAACTACGAAATCGTGTGAAAATGGGTTGG 
AACCCATCAGTGATCGCATATTCATTGATGAGGGTTTGCTTGAGATAGAAAATGGTGGCTCCTTT 
CTGTCTTATCTCCTAGTTTCTTCAATGCTTACGCCTTGTTCTTCTCAAGAGAAAGTTGTAACTCT 
CTGGTCTTCATATGTCCCTGTGCTCCTTTTAACCAAATAAAGAGTTCTTGTTTCTGGGGGAAAAA 
A?\AAAAAAZy\AAAAAAAAAAAAAM^AAAAAAAA^ 
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FIGURE 168 



MSRWSLLLGAALLCGHGAFCRRW 

GGVLLSLENEAEQKLIESMLQNLTKPGTGISDGDFWIGLWRNGDGQTSGACPDLYQWSDGSNSQ 
YRNWYTDEPSCGSEKCVVMYHQPTANPGLGGPYLYQWNDDRCNMKHNYICKYEPEINPTAPVE 
PYLTNQPGDTHQNVVVTEAGI I PNLI YWIPTI PLLLLILVAFGTCCFQMLHKSKGRTKTSPNQ 
STLWISKST RKE S GME V 

Important features of the protein: 
Signal peptide : 

amino acids 1-21 

Transmembrane domain: 

amino acids 214-235 

N-glycosylation sites. 

amino acids 86-89, 255-258 

cAMP- and cGMP^dependent protein kinase phosphorylation site* 

amino acids 266-269 

N-myristoylation sites. 

amino acids 27-32, 66-71, 91-96, 93-98, 102-107, 109-114, 140- 
145, 212-217 
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